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SURVEILLANCE SYSTEM AND METHOD 
HAVING AN ADAPTIVE SEQUENTIAL 
PROBABILITY FAULT DETECTION TEST 

CROSS-REFERENCE TO RELATED 5 

APPLICATIONS 

This application is a continuation patent application of 
U.S. application Ser. No. 10/095,835, filed Mar. 8, 2002, 
now U.S. Pat. No. 6,892,163. to 

STATEMENT REGARDING FEDERALLY 
SPONSORED RESEARCH AND 
DEVELOPMENT 

15 

The invention described herein was made in the perfor- 
mance of work under NASA Small Business Technology 
Transfer Research (STTR) Contract NAS8-98027, and is 
subject to the provisions of Public Law 96-517 (35 USC 
202) and the Code of Federal Regulations 48 CFR 52.227-1 1 20 
as modified by 48 CFR 1 852.227-1 1 , in which the contractor 
has elected to retain title. 

The United States Government has rights in this invention 
pursuant to Contract No. W-3 1-109 -ENG-3 8 between the 
United States Government and the University of Chicago 25 
representing Argonne National Laboratory. 

FIELD OF THE INVENTION 

The instant invention relates generally to a system and 30 
method for process fault detection using a statistically based 
decision test and, in particular, to a system and method for 
performing high sensitivity surveillance of an asset such as 
a process and/or apparatus wherein the surveillance is per- 
formed using an adaptive sequential probability (ASP) fault 35 
detection test comprised of a probability density function 
model empirically derived from a numerical analysis of 
asset operating data. 

BACKGROUND OF THE INVENTION 40 

Conventional process surveillance schemes are sensitive 
only to gross changes in the mean value of a process signal 
or to large steps or spikes that exceed some threshold limit 
value. These conventional methods suffer from either a large 45 
number of false alarms (if thresholds are set too close to 
normal operating levels) or from a large number of missed 
(or delayed) alarms (if the thresholds are set too expan- 
sively). Moreover, most conventional methods cannot per- 
ceive the onset of a process disturbance or sensor signal 50 
error that gives rise to a signal below the threshold level or 
an alarm condition. Most conventional methods also do not 
account for the relationship between measurements made by 
one sensor relative to another redundant sensor or between 
measurements made by one sensor relative to predicted 55 
values for the sensor. 

Recently, improved methods for process surveillance 
have developed from the application of certain aspects of 
artificial intelligence technology. Specifically, parameter 
estimation methods have been developed using either sta- 60 
tistical, mathematical or neural network techniques to learn 
a model of the normal patterns present in a system of process 
signals. After learning these patterns, the learned model is 
used as a parameter estimator to create one or more pre- 
dicted (virtual) signals given a new observation of the actual 65 
process signals. Further, high sensitivity surveillance meth- 
ods have been developed for detecting process and signal 
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faults by analysis of a mathematical comparison between an 
actual process signal and its virtual signal counterpart. In 
particular, such a mathematical comparison is most often 
performed on a residual error signal computed as, for 
example, the difference between an actual process signal and 
its virtual signal counterpart. 

Parameter estimation based surveillance schemes have 
been shown to provide improved surveillance relative to 
conventional schemes for a wide variety of assets including 
industrial, utility, business, medical, transportation, finan- 
cial, and biological systems. However, parameter estimation 
based surveillance schemes have in general shown limited 
success when applied to complex processes. Applicants 
recognize and believe that this is because the parameter 
estimation model for a complex process will, in general, 
produce residual error signals having a non-Gaussian prob- 
ability density function. Moreover, a review of the known 
prior-art discloses that virtually all such surveillance sys- 
tems developed to date utilize or assume a Gaussian model 
of the residual error signal probability density function for 
fault detection. Hence, a significant shortcoming of the 
known prior-art is that, inter alia, parameter estimation 
based surveillance schemes will produce numerous false 
alarms due to the modeling error introduced by the assump- 
tion of a Gaussian residual error signal probability density 
function. The implication for parameter estimation based 
surveillance schemes is that the fault detection sensitivity 
must be significantly reduced to prevent false alarms thereby 
limiting the utility of the method for process surveillance. 
An alternative for statistically derived fault detection models 
is to mathematically pre-process the residual error signals to 
remove non-Gaussian elements prior to using the residual 
error signals in the fault detection model; however this 
approach requires an excess of additional processing and 
also limits the sensitivity of the surveillance method. There- 
fore, the implication of assuming a Gaussian residual error 
signal probability density function for a parameter estima- 
tion based surveillance scheme is simply that the system 
becomes less accurate thereby degrading the sensitivity and 
utility of the surveillance method. 

Many attempts to apply statistical fault detection tech- 
niques to surveillance of assets such as industrial, utility, 
business, medical, transportation, financial, and biological 
processes have met with poor results in part because the fault 
detection models used or assumed a Gaussian residual error 
signal probability density function. 

In one specific example, a multivariate state estimation 
technique based surveillance system for the Space Shuttle 
Main Engine’s telemetry data was found to produce numer- 
ous false alarms when a Gaussian residual error fault detec- 
tion model was used for surveillance. In this case, the 
surveillance system’s fault detection threshold parameters 
were desensitized to reduce the false alarm rate; however, 
the missed alarm rate then became too high for practical use 
in the telemetry data monitoring application. 

Moreover, current fault detection techniques for surveil- 
lance of assets such as industrial, utility, business, medical, 
transportation, financial, and biological processes either fail 
to recognize the surveillance performance limitations that 
occur when a Gaussian residual error model is used or, 
recognizing such limitations, attempt to artificially conform 
the observed residual error data to fit the Gaussian model. 
This may be attributed, in part, to the relative immaturity of 
the field of artificial intelligence and computer-assisted 
surveillance with regard to real-world process control appli- 
cations. Additionally, a general failure to recognize the 
specific limitations that a Gaussian residual error model 
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imposes on fault decision accuracy for computer-assisted 
surveillance is punctuated by an apparent lack of known 
prior art teachings that address potential methods to over- 
come this limitation. In general, the known prior-art teaches 
computer-assisted surveillance solutions that either ignore 
the limitations of the Gaussian model for reasons of math- 
ematical convenience or attempt to conform the actual 
residual error data to the artificial Gaussian model, for 
example, by using frequency domain filtering and signal 
whitening techniques. 

For the foregoing reasons, there is a need for a surveil- 
lance system and method that overcomes the significant 
shortcomings of the known prior-art as delineated herein- 
above. 

BRIEF SUMMARY OF THE INVENTION 

The instant invention is distinguished over the known 
prior art in a multiplicity of ways. For one thing, an 
embodiment of the instant invention provides a surveillance 
system and method having a fault detection model of 
unconstrained probability density function form and having 
a procedure suitable for overcoming a performance limiting 
trade-off between probability density function modeling 
complexity and decision accuracy that has been unrecog- 
nized by the known prior-art. Specifically, an embodiment of 
the instant invention can employ any one of a plurality of 
residual error probability density function model forms, 
including but not limited to a Gaussian form, thereby 
allowing a surveillance system to utilize the model form best 
suited for optimizing surveillance system performance. 

Moreover, an embodiment of the instant invention pro- 
vides a surveillance system and method that uses a com- 
puter-assisted learning procedure to automatically derive the 
most suitable form of the residual error probability density 
function model by observation and analysis of a time 
sequence of process signal data and by a combination of a 
plurality of techniques. This ability enables surveillance to 
be performed by the instant invention with lower false alarm 
rates and lower missed alarm rates than can be achieved by 
the known prior-art systems and methods. 

Thus, an embodiment of the instant invention provides a 
surveillance system and method that performs its intended 
function much more effectively by enabling higher decision 
accuracy. Further, an embodiment of the instant invention is 
suitable for use with a plurality of parameter estimation 
methods thereby providing a capability to improve the 
decision performance of a wide variety of surveillance 
systems. 

In one embodiment, the instant invention provides a 
surveillance system and method that creates and uses, for the 
purpose of process surveillance, a multivariate state estima- 
tion technique parameter estimation method in combination 
with a statistical hypothesis test fault detection method 
having a probability density function model empirically 
derived from a numerical analysis of asset operating data. 

Particularly, and in one embodiment, the instant invention 
provides a surveillance system and method for providing 
surveillance of an asset such as a process and/or apparatus 
by providing training and surveillance procedures. 

In accordance with one embodiment, the instant invention 
provides a training procedure comprised of a calibrate 
parameter estimator procedure and a calibrate fault detector 
procedure. 

The calibrate parameter estimator procedure creates a 
parameter estimation model(s) and trains a parameter esti- 
mation model by utilizing training data correlative to 
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expected asset operation and, for example, utilizing a mul- 
tivariate state estimation technique (MSET) procedure. The 
calibrate parameter estimator procedure further stores this 
model as a new parameter estimation model in a process 
5 model. The process model may contain one or more param- 
eter estimation models depending upon design requirements. 
Additionally, each parameter estimation model contained in 
the process model may be created to implement any one of 
a plurality of parameter estimation techniques, 
to The calibrate fault detector procedure makes use of the 
parameter estimation models to provide estimated values for 
at least one signal parameter contained in the training data. 
Generally, the calibrate fault detector procedure will create 
a separate and distinct fault detection model for each sensor 
15 or data signal associated with the asset being monitored for 
the presence of fault conditions during the surveillance 
procedure. 

The calibrate fault detector procedure includes, for 
example, a method of fitting a standard Gaussian probability 
20 density function (PDF) to a training data residual distribu- 
tion (computed as a function of estimated process param- 
eters and the training data) and then adding successive 
higher order terms of a remainder function to the standard 
Gaussian PDF for the purpose of defining a general PDF that 
25 better fits the computed training data residual distribution. 
Other techniques for fitting a general PDF to the training 
data are similarly feasible and useful in accordance with the 
instant invention and include, for example, a technique for 
fitting a polynomial function to the training data. 

30 The training procedure is completed when all training 
data has been used to calibrate the process model. At this 
point, the process model preferably includes parameter 
estimation models and fault detection models for each 
sensor or data signal associated with the asset being moni- 
35 tored for the presence of fault conditions during the surveil- 
lance procedure. The process model is thereafter used for 
performing surveillance of the asset. 

The surveillance procedure is performed using an adap- 
tive sequential probability (ASP) fault detection test com- 
40 prised of the general probability density function model 
empirically derived from a numerical analysis of the asset 
training data. 

The surveillance procedure acquires and digitizes current 
asset data and then estimates process parameters as a func- 
45 tion of the acquired digitized current asset data and the 
parameter estimation model (s) obtained from the calibrate 
parameter estimator procedure. Then, fault detection is 
determined by first computing data residuals as a function of 
the estimated process parameters and the acquired digitized 
50 current asset data and then performing the ASP test(s) as a 
function of the fault detection models and thus, as a function 
of the fitted general PDF obtained in the calibrate fault 
detector procedure. Each ASP test returns one of three 
possible states: a not null state which rejects the probability 
55 that a null hypothesis is true and excepts an alternative 
hypothesis correlative to unexpected operation of the asset; 
a null state which accepts the probability that a null hypoth- 
esis is true and excepts the null hypothesis correlative to 
expected operation of the asset; and an in-between state 
60 which excepts neither the null hypothesis nor the alternative 
hypothesis as being true and requires more data to reach a 
conclusion. 

The results of the fault detection procedure are then 
analyzed according to the instant invention such that an 
65 alarm and/or a control action are taken when the analysis 
determines that the results indicate unexpected operation of 
the asset. 
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Hence, the instant invention is distinguished over the 
known prior-art by providing a surveillance system and 
method for performing surveillance of an asset by acquiring 
residual data correlative to expected asset operation; fitting 
a mathematical model to the acquired residual data; storing 
the mathematical model in a memory means; collecting a 
current set of observed signal data values from the asset; 
using the mathematical mode in a sequential hypotheses test 
to determine if the current set of observed signal data values 
indicate unexpected operation of the asset indicative a fault 
condition; and outputting a signal correlative to each 
detected fault condition for providing asset surveillance. 

Accordingly, the instant invention provides a new, novel 
and useful surveillance system and method having an adap- 
tive sequential probability fault detection test. 

In one embodiment of the instant invention the surveil- 
lance system and method includes an unconstrained form of 
a residual error probability density function model used in 
said surveillance system’s fault detection method. 

In one embodiment of the instant invention the surveil- 
lance system and method can perform high sensitivity 
surveillance for a wide variety of assets including industrial, 
utility, business, medical, transportation, financial, and bio- 
logical processes and apparatuses wherein such process 
and/or apparatus asset preferably has at least one pair of 
redundant actual and/or virtual signals. 

In one embodiment of the instant invention the surveil- 
lance system and method includes a statistical hypothesis 
test surveillance decision procedure that uses a fault detec- 
tion model comprised of a probability density function 
model of a residual error signal that is of an unconstrained 
form. 

In one embodiment of the instant invention the surveil- 
lance system and method creates an improved fault detection 
model for a process surveillance scheme using recorded 
operating data for an asset to train a fault detection model. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
method for surveillance of on-line, real-time signals, or 
off-line accumulated signal data. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
method for surveillance of signal sources and detecting a 
fault or error state of the signal sources enabling responsive 
action thereto. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
method for surveillance of signal sources and detecting a 
fault or error state of the asset processes and apparatuses 
enabling responsive action thereto. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved decision as 
to the accuracy or validity for at least one process signal 
parameter given an observation of at least one actual signal 
from the asset. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
method for ultra-sensitive detection of a fault or error state 
of signal sources and/or asset processes and apparatuses 
wherein the parameter estimation technique used for the 
generation of at least one virtual signal parameter is a 
multivariate state estimation technique (MSET) having any 
one of a plurality of pattern recognition matrix operators, 
training procedures, and operating procedures. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
method for ultra-sensitive detection of a fault or error state 
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of signal sources and/or asset processes and apparatuses 
wherein the parameter estimation technique used for the 
generation of at least one virtual signal parameter is a neural 
network having any one of a plurality of structures, training 
5 procedures, and operating procedures. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
method for ultra-sensitive detection of a fault or error state 
of signal sources and/or asset processes and apparatuses 
to wherein the parameter estimation technique used for the 
generation of at least one virtual signal parameter is a 
mathematical process model having any one of a plurality of 
structures, training procedures, and operating procedures. 

In one embodiment of the instant invention the surveil- 
15 lance system and method provides an improved system and 
method for ultra-sensitive detection of a fault or error state 
of signal sources and/or asset processes and apparatuses 
wherein the parameter estimation technique used for the 
generation of at least one virtual signal parameter is an 
20 autoregressive moving average (ARMA) model having any 
one of a plurality of structures, training procedures, and 
operating procedures. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
25 method for ultra-sensitive detection of a fault or error state 
of signal sources and/or asset processes and apparatuses 
wherein the parameter estimation technique used for the 
generation of at least one virtual signal parameter is a 
Kalman filter model having any one of a plurality of 
30 structures, training procedures, and operating procedures. 

In one embodiment of the instant invention the surveil- 
lance system and method provides a novel system and 
method for using at least one of a plurality of methods to 
classify the state of a residual error signal produced by the 
35 mathematical difference between two signals, said two sig- 
nals being either actual and/or predicted signals, into one of 
at least two categories. 

In one embodiment of the instant invention the surveil- 
lance system and method provides a novel system and 
40 method to classify the state of a residual error signal wherein 
said classification is made to distinguish between a normal 
signal and a abnormal signal. 

In one embodiment of the instant invention the surveil- 
lance system and method provides a novel system and 
45 method to classify the state of a residual error signal wherein 
said classification is performed using a statistical hypothesis 
test having any one of a plurality of probability density 
function models, training procedures, and operating proce- 
dures. 

50 In one embodiment of the instant invention the surveil- 
lance system and method provides a novel system and 
method to classify the state of a residual error signal wherein 
said classification is performed using a probability density 
function model having any one of a plurality of structures, 
55 training procedures, and operating procedures. 

Accordingly, having thus summarized the invention, it 
should be apparent that numerous modifications and adap- 
tations may be resorted to without departing from the scope 
and fair meaning of the present invention as set forth 
60 hereinbelow by the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram of a surveillance system of an 
65 embodiment in accordance with the instant invention. 

FIG. 2 is a schematic functional flow diagram of an 
embodiment in accordance with the instant invention. 
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FIG. 3 is a schematic functional flow diagram of a method 
and system for training a process model consisting of at least 
one parameter estimation model and at least one fault 
detection model using recorded observations of the actual 
process signals in accordance with the instant invention. 

FIG. 4 is a schematic functional flow diagram of a method 
and system for the fault detection model training procedure 
in accordance with the instant invention. 

FIG. 5 is a schematic functional flow diagram of a 
preferred method and system for performing surveillance of 
an asset using at least one parameter estimation model and 
at least one fault detection model in accordance with the 
instant invention. 

FIG. 6 is a schematic functional flow diagram of a method 
and system for the fault detection surveillance procedure in 
accordance with the instant invention. 

FIG. 7 is a schematic functional flow diagram of a method 
and system for performing parameter estimation and fault 
detection using a redundant sensor. 

FIG. 8 is a schematic functional flow diagram of a method 
and system for performing parameter estimation and fault 
detection using a generalized parameter estimation model, 
such as a multivariate state estimation model, a neural 
network model, an analytical model, or a Kalman filter 
model. 

FIG. 9 is a flow diagram of a training and surveillance 
procedure in accordance with the instant invention. 

FIG. 10 illustrates characteristics of the null and alternate 
hypotheses for the prior-art sequential probability ratio test 
(SPRT) mean tests. 

FIG. 11 illustrates characteristics of the null and alternate 
hypotheses for the prior-art SPRT variance tests. 

FIG. 12 illustrates acquired operating data, estimated 
parameter data, and residual error data for a typical Space 
Shuttle Main Engine accelerometer. 

FIG. 13 illustrates a probability density function of the 
residual error data for a typical Space Shuttle Main Engine 
accelerometer with comparison to a Gaussian probability 
density function. 

FIG. 14 illustrates an un-optimized one-term expansion 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 15 illustrates an un-optimized two-term expansion 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 16 illustrates an un-optimized three-term expansion 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 17 lists the root mean square errors for five different 
un-optimized probability density function models computed 
for each of six Space Shuttle Main Engine accelerometers 
used for feasibility testing of a preferred embodiment in 
accordance with the instant invention. 

FIG. 18 lists the root mean square errors for five different 
optimized probability density function models computed for 
each of six Space Shuttle Main Engine accelerometers used 
for feasibility testing of a preferred embodiment in accor- 
dance with the instant invention. 

FIG. 19 illustrates the optimized two-term expansion 
probability density function model of the residual error data 
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for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 20 illustrates the empirical false alarm rates for 
5 feasibility testing of a preferred embodiment in accordance 
with the instant invention with comparison to the prior art 
SPRT method. 

DETAILED DESCRIPTION OF THE 
io INVENTION 

Considering the drawings, wherein like reference numer- 
als denote like parts throughout the various drawing figures, 
reference numeral 10 is directed to the system according to 
15 the instant invention. 

In its essence, and referring to FIGS. 1 and 2, the system 
10 is generally comprised of a method and apparatus for 
performing high sensitivity surveillance of a wide variety of 
assets including industrial, utility, business, medical, trans- 
20 portation, financial, and biological processes and appara- 
tuses wherein such process and/or apparatus asset preferably 
has at least one distinct measured or observed signal or 
sequence comprised of characteristic data values which are 
processed by the system 10 described herein for providing 
25 ultra-sensitive detection of the onset of sensor or data signal 
degradation, component performance degradation, and pro- 
cess operating anomalies. The system 10 includes a training 
procedure 30 carried out on a computer 22 such that a 
process model 50 of an asset 12 (e.g., a process and/or 
30 apparatus) is stored in an associated memory means 24 after 
being learned from historical operating data using at least 
one of a plurality of computer-assisted procedures in accor- 
dance with the instant invention. The historical operating 
data includes a set or range of observations from expected or 
35 typical operation of the asset 12 that are acquired and 
digitized by a data acquisition means 20 and stored in a 
memory means 24 as training data 34 by using any combi- 
nation of electronic data acquisition hardware and signal 
processing software 20 known to those having ordinary skill 
40 in the art, and informed by the present disclosure. As will be 
delineated in detail infra, one hallmark of the instant inven- 
tion is the process model 50 for the asset 12 that is derived 
during the training procedure 30. 

The system 10 further includes a surveillance procedure 
45 60 wherein the stored process model 50 is used for high 
sensitivity computer-assisted surveillance of the asset 12 for 
the purpose of determining whether a process fault or failure 
necessitates an alarm or control action. The process model 
50 is comprised of a parameter estimation model 52 or 
50 collection of parameter estimation models 52 as necessary to 
provide an estimated value for each sensor or data signal 14 
of asset 12 to be monitored for the presence of fault 
conditions during the surveillance procedure 60. The pro- 
cess model 50 is further comprised of a fault detection model 
55 54 or collection of fault detection models 54 such that at 
least one fault detection model 54 is provided for each 
sensor or data signal 14 of asset 12 to be monitored for the 
presence of fault conditions during the surveillance proce- 
dure 60. The fault detection model 54 is another hallmark of 
60 the instant invention and will be delineated in further detail 
hereinbelow. 

Process Model Training Procedure: 

More specifically, and referring to FIGS. 1 and 3, the 
training procedure 30 of the system 10 includes a method 
65 and apparatus for training or preparing the process model 50 
using historical operating data from the asset 12 that has 
been acquired by the data acquisition means 20 using any 
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combination of conventional electronic data acquisition 
hardware and signal processing software as is well known in 
the art. The historical operating data is acquired in digital 
format and stored in memory means 24 using a data storage 
procedure 32 to create a training data set 34. 

The training data set 34 includes at least N discrete 
observations of the asset 12 wherein each single observa- 
tion, herein denoted Xobs, is comprised of a vector of data 
values for each signal parameter to be included in the 
process model 50. For the purposes of the training procedure 
30, the number of observations, N, acquired is at least great 
enough to adequately bound the operating state space of the 
asset 12. Thus, the training data set 34 provides a represen- 
tative sample of the signals produced by the asset 12 during 
all normal modes of operation. 

Upon acquiring the training data set 34, the unique 
training procedure 30 can be implemented in accordance 
with instant invention. 

The unique training procedure 30 is comprised of a 
calibrate parameter estimator procedure 36 and a calibrate 
fault detector procedure 38. The calibrate parameter estima- 
tor procedure 36 creates the parameter estimation model 52 
and trains the parameter estimation model 52 using the 
training data 34. The calibrate parameter estimator proce- 
dure 36 further stores this model as a new parameter 
estimation model 52 in the process model 50. 

The process model 50 may contain one or more parameter 
estimation models 52 depending upon the requirements of 
the approach taken by a designer. Continuing to refer to FIG. 
3, the training procedure 30 may be, in general, performed 
using any parameter estimation method suitable for defining 
a parameter estimation model 52 useful for estimating the 
values of one or more process signals. Methods suitable for 
the calibrate parameter estimator procedure 36 include, but 
are not limited to, a plurality of redundant sensor techniques, 
a plurality of multivariate state estimation techniques, a 
plurality of neural network techniques, a plurality of math- 
ematical model techniques, a plurality of autoregressive 
moving average techniques, and a plurality of Kalman filter 
techniques. Each parameter estimation model 52 contained 
in the process model 50 may be created to implement any 
one of a plurality of parameter estimation techniques. Fur- 
ther, the parameter estimation technique implemented for an 
individual parameter estimation model 52 is not constrained 
to be the same as the parameter estimation technique imple- 
mented for any other parameter estimation model 52 con- 
tained in the process model 50. 

One example of the calibrate parameter estimator proce- 
dure 36 would be the computation of a bias term between 
two redundant sensors wherein the parameter estimation 
model 52 used for estimating the value of one sensor during 
the surveillance procedure 60 consisted of summing the 
observed value of a second redundant sensor with a bias 
term computed during the training procedure 30 as the mean 
difference between the two sensor values over the training 
data set 34. More sophisticated examples of the training 
procedure 30 using multivariate state estimation techniques 
will be described herein below. 

Still referring to FIG. 3, the calibrate fault detector 
procedure 38 makes use of the parameter estimation models 
52 to provide estimated values for at least one signal 
parameter contained in the training data. Generally, the 
calibrate fault detector procedure 38 will create a separate 
and distinct fault detection model 54 for each sensor or data 
signal of asset 12 to be monitored for the presence of fault 
conditions during the surveillance procedure 60. As delin- 
eated infra, one hallmark of the instant invention is the fault 
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detection model 54 element of the process model 50 for the 
asset 12 that is derived during the training procedure 30. In 
particular, the instant invention encompasses a statistical 
hypothesis test type of fault detection model 54 having novel 
5 and unique characteristics and calibration procedures 
described herein including but not limited to having a 
probability density function model empirically derived from 
a numerical analysis of asset operating data. 

Continuing to refer to FIG. 3, the training procedure 30 is 
to completed when all training data 34 has been used to 
calibrate the process model 50. At this point, the process 
model 50 includes parameter estimation models 52 and fault 
detection models 54 for each sensor or data signal of asset 
12 to be monitored for the presence of fault conditions 
15 during the surveillance procedure 60. The process model 
array 50 is thereafter useful for performing surveillance of 
the asset 12. 

Referring to FIG. 4, the training procedure 30 is illus- 
trated in additional detail. A designer 16 initializes the 
20 calibrate parameter estimators procedure 36 by specifying a 
set of parameter estimator methods and settings 40. The 
parameter estimator methods and settings 40 are then used 
to operate on the training data 34 via a nominal signal 
behavior modeling procedure 42, for example using an 
25 MSET training procedure as described herein in detail, in 
order to create the parameter estimation models 52, which 
are stored in the process model 50. 

Still referring to FIG. 4, the training procedure 30 next 
proceeds to the calibrate fault detectors procedure 38 
30 wherein the parameter estimation models 52 are an input to 
the procedure. The designer 16 initializes the calibrate fault 
detectors procedure 38 by specifying a set of fault detector 
methods and settings 46. Next, the estimate process param- 
eters procedure 64 operates the parameter estimation models 
35 52 over the training data 34 to generate an estimated value 
for each monitored signal value contained in the training 
data 34. It is important that the estimate process parameters 
procedure 64 used in the calibrate fault detectors procedure 
38 be the same estimate process parameters procedure 64 
40 that will later be used in the surveillance procedure 60 
(reference FIGS. 1 and 2 for surveillance procedure 60). 
Next, A compute training data residuals procedure 44 cal- 
culates the training data residuals for each monitored signal, 
which are calculated as the difference between the training 
45 data value and the corresponding estimated data value for 
each monitored signal. The training data residuals are next 
used by a compute nominal residual probability density 
function (PDF) procedure 48 to create a fault detection 
model 54 for each monitored signal. In accordance with the 
50 instant invention, the fault detection models 54 is typically 
comprised of mathematical descriptions of the probability 
density function that best characterizes or best fits the 
training data residual for the monitored signal. The training 
data is presumed to accurately characterize the expected 
55 normal operating states of the asset 12. Therefore, the 
training data residuals are characteristic of the expected 
normal deviations between the observed signal values and 
the values estimated using the parameter estimation models 
52. The fault detection models 54 are stored in the process 
60 model 50 thereby completing the training procedure. 

One hallmark of the instant invention is the method and 
system for computing the fault detection models 54 by the 
means of the compute nominal residual probability density 
function (PDF) procedure 48. As will be described math- 
65 ematically herein below, the compute nominal residual prob- 
ability density function (PDF) procedure 48 fits a general 
open-ended probability function to the training data residu- 
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als and employs this fitted function when implementing a 
herein named Adaptive Sequential Probability (ASP) 
method and system for computing the fault detection model 
54 and thereafter employing said fault detection model 54 
for the purpose of performing a fault detection procedure 66 
of the surveillance procedure 60. 

Surveillance Procedure: 

Referring to FIG. 5, the surveillance procedure 60 is 
comprised of acquiring successive vectors of current oper- 
ating data and determining for each such observation vector 
whether the current operating data is indicative of a fault or 
failure of the asset 12. The surveillance procedure 60 further 
includes implementing an alarm or control action 70 for the 
purpose of notifying an operator and/or taking a corrective 
action in response to a detected fault or failure of the asset 
12. The surveillance procedure 60 is in general an open- 
ended data acquisition and analysis loop that continues until 
such time as the operator chooses to terminate the surveil- 
lance 74. 

More specifically, and referring to FIG. 5, the surveillance 
procedure 60 begins 58 with an acquire current operating 
data procedure 62 that employs the data acquisition and 
digitization means 20 (FIG. 1) to acquire a current set of 
signal data from the monitored asset 12. The current set of 
signal data is provided to the estimate process parameters 
procedure 64 that uses the parameter estimation models 52 
to estimate values for one or more of the current signal data 
values. The observed and estimated data are next provided 
to a perform fault detection procedure 66 that uses the fault 
detection models 54 to determine whether a fault is found 68 
in the current operating data. If a fault is found 68 is true, an 
alarm and/or control action 70 is taken. Upon completing the 
fault found procedure 68, the surveillance procedure 60 then 
repeats for the next available set of signal data for as long as 
a surveillance complete decision procedure 72 determines 
that additional surveillance data are available or terminates 
at surveillance complete step 74 when no more surveillance 
data are available or when terminated by an operator. 

Referring now to FIG. 6, the perform fault detection 
procedure 66 of surveillance procedure 60 is illustrated in 
additional detail. For each current set of signal data values 
acquired the estimate process parameters procedure 64 uses 
the parameter estimation models 52 to estimate values for 
one or more of the current signal data values. The compute 
data residuals procedure 76 performs a mathematical trans- 
formation on the acquired and estimated values to produce 
a current set of residual data values. Said mathematical 
transformation is most typically a simple mathematical 
difference, however, any appropriate transformation may be 
used including transformations that remove correlated and 
uncorrelated noise from the residual data values. The residu- 
als produced and transformed in the compute data residuals 
procedure 76 are next processed by a perform ASP tests 
procedure 78 that uses the fault detection models 54 to 
produce a ASP fault indication that is a hallmark of the 
method and system of the instant invention. Next, the fault 
found decision procedure 68 is performed on the basis of the 
ASP fault indication results produced by the perform ASP 
tests procedure 78. The fault found decision procedure 68 
may have any one of a plurality of structures and procedures, 
including but not limited to methods and systems to perform 
false alarm filtering by means of observing a time series of 
ASP fault indication results for the purposes of determining 
the actual presence of a fault. In one preferred embodiment 
of the instant invention, a conditional probability fault found 
decision procedure 68 is used to perform said false alarm 
filtering. 
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Continuing to refer to FIG. 6, the estimate process param- 
eters procedure 64 uses the parameter estimation models 52 
to estimate values for one or more of the current signal data 
values wherein the parameter estimation method used may 
5 have any one of a plurality of structures and procedures, 
including but not limited to, a plurality of redundant sensor 
techniques, a plurality of multivariate state estimation tech- 
niques, a plurality of neural network techniques, a plurality 
of mathematical model techniques, a plurality of autoregres- 
10 sive moving average techniques, and a plurality of Kalman 
filter techniques. 

Referring to FIG. 7, one possible redundant sensor tech- 
nique for the estimate process parameters procedure 64 is 
illustrated. The acquire current operating data procedure 62 
15 is used to acquire current signal data values from signals 14 
monitored from asset 12 via sensors 18. The estimated value 
for a first redundant sensor signal is computed using a 
mathematical transformation on the acquired value of a 
second redundant sensor signal. Said mathematical transfor- 
20 mation is the estimate process parameters procedure 64 that 
in this case may be a simple equivalence or may include 
biasing, de-noising or other signal processing. The compute 
data residuals procedure 76 is then performed followed by 
the perform ASP tests procedure 78 as described herein- 
25 above and further delineated hereinbelow. 

Referring to FIG. 8, one possible multivariable parameter 
estimation technique for the estimate process parameters 
procedure 64 is illustrated. The acquire current operating 
data procedure 62 is used to acquire current signal data 
30 values from signals 14 monitored from asset 12 via sensors 
18. The estimated value for one or more sensor signals is 
computed using a mathematical transformation on the 
acquired values of one or more sensor signals. Said math- 
ematical transformation is the estimate process parameters 
35 procedure 64 that in this case may implement any feasible 
parameter estimation technique or procedure, including but 
not limited to a plurality of multivariate state estimation 
techniques, a plurality of neural network techniques, a 
plurality of mathematical model techniques, and a plurality 
40 of Kalman filter techniques. The compute data residuals 
procedure 76 is then performed followed by the perform 
ASP tests procedure 78 as described hereinabove and further 
delineated hereinbelow. 

Referring again to FIG. 6, the usefulness of the instant 
45 invention is, inter alia, the improvement achieved in the 
accuracy of the fault decision procedure 68 that results from 
the improvement achieved in the accuracy of perform fault 
detection procedure 66 made possible by the novel perform 
ASP tests procedure 78 that is a hallmark of the instant 
50 invention. Improving the accuracy of the fault decision 
procedure 68 accomplishes a reduction in the number of 
false alarms sent to a process operator or control system that 
can in turn result in an erroneous alarm or control action by 
the alarm or control action procedure 70. Further, improving 
55 the accuracy of the fault decision procedure 68 accomplishes 
a reduction in the number of missed alarms thereby accom- 
plishing more timely alarm or control action by the alarm or 
control action procedure 70. The instant invention thereby 
enables improved operating safety, improved efficiency and 
60 performance, and reduced maintenance costs for a wide 
variety of industrial, utility, business, medical, transporta- 
tion, financial, and biological processes and apparatuses 
wherein such process and/or apparatus asset 12 has at least 
one characteristic data signal suitable for surveillance. 

65 In use and operation, and in one preferred form, FIGS. 1 
and 9 outline a general surveillance procedure of the system 
10 when employing the novel fault detection model 54 



US 7,082,379 B1 


13 


14 


contained in the process model 50 and the accompanying 
novel fault detection procedure 66 having the perform ASP 
tests procedure 78 . In a typical surveillance procedure, the 
asset 12 is the source of at least one process signal 14 that 
is acquired and digitized using conventional data acquisition 5 
means 20 for providing the data acquisition procedure for 
the purpose of computer-assisted surveillance. The digitized 
signal data is generally evaluated using a computer 22 
having computer software modules implementing the esti- 
mate process parameters procedure 64 , and the perform fault 10 
detection procedure 66. The estimate process parameters 
procedure 64 is used to produce an estimated signal value for 
at least one process signal 14 emanating from the asset 12 . 
The estimate process parameters procedure 64 in general 
makes use of the process model 50 stored in a memory 15 
means 24 associated with the computer 22 to produce the 
estimated signal values. The estimated signal values are then 
generally evaluated using the perform fault detection pro- 
cedure 66 to identify faults or operating anomalies in the 
asset 12 . The results of the fault detection evaluation are 20 
thereafter communicated by a conventional communications 
means 26 (as is known to those having ordinary skill in the 
art, and informed by the present disclosure) to an operator 
console 80 and/or automated process control system 82 for 
possible alarm and/or control action. 

The computer 22 along with the associated memory 
means 24 can also be employed to perform the training and 
surveillance procedures 30 , 60 as delineated hereinabove 
and to produce and store all the data associated with these 3Q 
procedures, for example, the historical training data, 
designer defined settings, and the process model. 

Multivariate State Estimation Technique (MSET) for Esti- 
mate Process Parameters Procedure: 

In one embodiment of the instant invention, the estimate 35 
process parameters procedure 64 uses a multivariate state 
estimation technique (MSET) procedure having an MSET 
parameter estimation model 52 structure. The US Depart- 
ment of Energy’s Argonne National Laboratory originally 
developed the implementation of MSET described herein for 40 
surveillance of sensors and components in nuclear power 
plant applications. However, other implementations of a 
multivariate state estimation technique are possible and 
useful in conjunction with the instant invention. MSET is in 
general a statistically derived parameter estimation algo- 45 
rithm that uses advanced pattern recognition techniques to 
measure the similarity or overlap between signals within a 
defined domain of asset operation (set of asset operating 
states). MSET “learns” patterns among the signals by 
numerical analysis of historical asset operating data. These 50 
learned patterns or relationships among the signals are then 
used to identify the learned state that most closely corre- 
sponds with a new signal data observation. By quantifying 
the relationship between the current and learned states, 
MSET estimates the current expected response of the asset 55 
signals. MSET then uses threshold comparisons or a form of 
statistical hypothesis testing, such as a Sequential Probabil- 
ity Ratio Test (SPRT) as disclosed inU.S. Pat. No. 5,459,675 
which is hereby incorporated by reference in its entirety or 
the Adaptive Sequential Probability (ASP) procedure that is 60 
a hallmark of the instant invention to compare the current 
estimated value of a signal with its observed value. The 
MSET procedure provides an accurate and widely appli- 
cable method to estimate the operating signal values for an 
asset. However, other implementations of the parameter 65 
estimation procedure are possible and useful in conjunction 
with the instant invention. 


An MSET model is created for the asset 12 using the 
MSET training algorithms to learn the inherent data rela- 
tionships within a set of historical asset operating data. The 
trained MSET model is then used with the MSET parameter 
estimation and fault detection algorithms to perform the 
process surveillance function when presented with a new 
observation of signal data values. The following sections 
will first provide a mathematical overview of the MSET 
algorithms and procedures useiul for training a parameter 
estimation model and for using this trained model for 
process surveillance. The description is followed by a 
detailed description of a preferred embodiment of the instant 
invention using a novel Adaptive Sequential Probability 
(ASP) procedure for fault detection during process surveil- 
lance. 

MSET Training and Parameter Estimation Procedures: 

The MSET methods are generally described in the fol- 
lowing two US Government documents produced and main- 
tained by the US Department of Energy’s Argonne National 
Laboratory, Argonne, 111., disclosure of which is incorpo- 
rated in its entirety herein by reference. 

J. P. Herzog, S. W. Wegerich, R. M. Singer, and K. C. 
Gross, “Theoretical Basis of the Multivariate State Estima- 
tion Technique (MSET),” Argonne National Laboratory, 
ANL -NT-49, December 1997. 

J. P. Herzog, S. W. Wegerich, K. C. Gross, and R. M. 
Singer, “MSET: Code Structure and Interface Development 
Guide,” ANL-NT-48, August 1997. 

The MSET algorithm uses pattern recognition with his- 
torical operating data from an asset to generate a parameter 
estimation model. If data is collected from a process over a 
range of operating states, this data can be arranged in matrix 
form, where each column vector (a total of m) in the matrix 
represents the measurements made at a particular state. 
Thus, this matrix will have the number of columns equal to 
the number of states at which observations were made and 
the number of rows equal to the number of measurements (a 
total of n signal data values) that were available at each 
observation. We begin by defining the set of measurements 
taken at a given time Xj as an observation vector X(t 7 ), 

X(tj)=[x iO tj , ), x 2 (tj), . . . , x n (tj)} T (El) 

where x.(t y .) is the measurement from signal i at time t^. We 
then define the data collection matrix as the process memory 
matrix D: 


lAl ^ 1,2 ■■■ ,m 

| ^2,1 A 2 ■■■ d 2 jn 


= [*(?!), *(r 2 ), ... 


d n ,l <4,2 


(E2) 


Each of the column vectors (X(1p) in the process memory 
matrix represents an operating state of the asset. Any number 
of observation vectors can be assigned to the process 
memory matrix. Training an MSET model includes collect- 
ing enough unique observation vectors from historical 
operation of the asset during normal conditions such that the 
process memory matrix encompasses the hill dynamic oper- 
ating range of the asset. Computation of the D matrix is the 
first of three steps in the method for training an MSET model 
based on historical operating data. 

One of at least two algorithms is used by MSET to select 
the vectors in the D matrix. The MinMax algorithm extracts 
vectors that bound the vector space defined by the training 
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data and returns the smallest process memory matrix that 
will produce an effective system model (see also U.S. Pat. 
No. 5,764,509 which is hereby incorporated by reference in 
its entirety). A vector ordering algorithm selects and 
includes representative vectors from the inner regions of the 5 
vector space producing a more accurate system model. 

Once the process memory matrix has been constructed, 
MSET is used to model the dynamic behavior of the system. 

For each current observation of the system (Xobs), MSET 
compares the observation vector to the stored operating to 
states to calculate an estimate of the process parameter 
values. The parameter estimate of the current process state 
(Xest) is an n-element vector that is given by the product of 
the process memory matrix and a weight vector, W: 


x est = DW 


(E3) 


The weight vector represents a measure of similarity 20 
between the estimate of the current state and the process 
memory matrix. To obtain the weight vector, we minimize 
the error vector, R, where: 

R=x ob -x est (E4) 25 

The error is minimized for a given state when: 


— » <-.7 _ -1 

W = (D ® 0 ) 


\D X a 


(E5) 

30 


This equation represents a “least squares” minimization 
when the pattern recognition operator <g,is the matrix dot 
product. Several advanced pattern recognition operators 35 
have been defined that provide excellent parameter estima- 
tion performance. Pattern recognition operators used by 
MSET include, but are not limited to, the System State 
Analyzer (SSA) method (see also U.S. Pat. No. 4,937,763 
which is hereby incorporated by reference in its entirety), the 4Q 
Bounded Angle Ratio Test (BART) method (see also U.S. 
Pat. No. 5,987,399 which is hereby incorporated by refer- 
ence in its entirety), the Vector Pattern Recognizer (VPR) 
method, the Vector Similarity Evaluation Technique (VSET) 
method, and the Probabilistic State Estimation Method 
(PSEM). 

Once the weight vector is found, the resulting current state 
estimate of the system (i.e., the parameter estimate vector) 
is given by: 


X est = D-(D®D) - 


(E6) 


The first application of the pattern recognition operator in 55 
equation E6 (D r <gl)) involves a comparison between the row 
vectors in the D T matrix and each of the column vectors in 
the D matrix. If we define G=D r <g)D, then G, the similarity 
matrix, is an m by m matrix. The element in the i-th row and 
j-th column of the matrix (g zV ) represents a measure of the 60 
similarity between the i-th and j-th column vectors (i.e., 
memorized states) in the process memory matrix. The sec- 
ond application of the pattern recognition operator in equa- 
tion E6 (D r (g)X 0 ^) involves a comparison between the row 
vectors in the D r matrix and each of the elements in the 65 
observation vector Xobs. If we define A=D T <g > X obs , then A, 
the similarity vector, is an m by 1 vector. Each element in the 


similarity vector is a measure of the similarity between the 
observation vector and the i-th column vector (i.e., memo- 
rized state) in the process memory matrix. 

Note that the similarity matrix is a function of the process 
memory matrix only. Thus, the similarity matrix and its 
inverse Ginv=(D r g)D) -1 can be calculated as soon as the 
process memory matrix has been derived thereby making the 
application of MSET to an on-line surveillance system more 
computationally efficient. Computation of the Ginv matrix 
initializes the parameter estimation model and completes the 
second of three steps in the procedure for training an MSET 
model based on historical operating data. 

The third and final step in the MSET training procedure 
includes analyzing the historical training data using equation 
E4 to produce a residual error vector, R, for each observation 
vector in the training data. The collection of residual error 
vectors comprises the training data residuals necessary for 
training the fault detection model 54 using any one of a 
plurality of techniques, including but not limited to the 
SPRT technique, and the novel ASP technique that is a 
hallmark of the instant invention. 

The Sequential Probability Ratio Test (SPRT) technique is 
a statistical hypothesis test fault detection algorithm histori- 
cally used for MSET process surveillance. The SPRT tech- 
nique is described in U.S. Pat. No. 5,459,675, which is 
incorporated herein by reference in its entirety. The SPRT 
analyzes a sequence of discrete residual error values from a 
signal to determine whether the sequence is consistent with 
normal signal behavior or with some other abnormal behav- 
ior. When the SPRT reaches a decision about the current 
signal behavior, e.g., that the signal is behaving normally or 
abnormally, the decision is reported and the test continues 
analyzing the signal data. For any SPRT, signal behavior is 
defined to be normal when the signal data adheres to a 
Gaussian probability density function (PDF) with mean 0 
and variance a 2 . Normal signal behavior is referred to as the 
null hypothesis, H 0 . MSET employs four specific SPRT 
hypothesis tests. Each test determines whether current signal 
behavior is consistent with the null hypothesis or one of four 
alternative hypotheses. The four tests are known as the 
positive mean test, the negative mean test, the nominal 
variance test, and the inverse variance test. For the positive 
mean test, the corresponding alternative hypothesis, H l5 is 
that the signal data adhere to a Gaussian PDF with mean+M 
and variance a 2 . For the negative mean test, the correspond- 
ing alternative hypothesis, H 2 , is that the signal data adheres 
to a Gaussian PDF with mean-M and variance a 2 . For the 
nominal variance test, the corresponding alternative hypoth- 
esis, H 3 , is that the signal data adheres to a Gaussian PDF 
with mean 0 and variance Vcr 2 . For the inverse variance test, 
the corresponding alternative hypothesis, H 4 , is that the 
signal data adheres to a Gaussian PDF with mean 0 and 
variance c^/V. The user-assigned constants M and V control 
the sensitivity of the tests. 

Limitations of the SPRT Fault Detector Training and 
Surveillance Method and System: 

One significant shortcoming of the SPRT technique is 
found in the assumptions underlying its mathematical for- 
mulation. Specifically, the SPRT technique presumes that 
the residual error signals adhere to a Gaussian probability 
density function. For residual error signals that are non- 
Gaussian, the fault detector false alarm rates and/or missed 
alarm rates specified by the designer are not accomplished 
by the SPRT procedure thereby degrading the fault decision 
accuracy of the asset control and/or surveillance system. The 
novel ASP technique of the instant invention specifically 
removes the assumption that the residual error signals 
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adhere to a Gaussian probability density function. The ASP 
technique implements any one of a plurality of methods to 
numerically fit a probability density function to the observed 
residual error signal distribution that is characteristic of 
normal asset operation. The derived probability density 3 
function is then used to perform a dynamic statistical 
hypothesis test thereby achieving the fault detector false 
alarm and missed alarm rates specified by the designer and 
improving the fault decision accuracy of the asset control 
and/or surveillance system. 10 

Fault Detection Using Statistical Hypothesis Test Proce- 
dures: 


the probability of two events occurring jointly can be 
expressed as a function of the conditional probability of one 
event based on the other, 

P{R, S)=P(R\S)P(R) (E9) 

If R and S are independent events, then P(RIS)=P(R) and 
the multiplication law simplifies to P(R, S)=P(R) P(S). 

Bayes’ rule can be simplified by eliminating the denomi- 
nator in equation E7. Combining equations E8 and E9, and 
substituting e for R and H for S, the denominator can be 
written as 


The general theory underlying the statistical hypothesis 
test will now be delineated below. Next, the SPRT imple- 
mentation of a dynamic statistical hypothesis test will be 
described. Finally, the novel ASP implementation of a 
dynamic statistical hypothesis test for non-Gaussian residual 
error signals will be delineated in detail along with a 
delineation of its reduction to practice. 20 

Bayes’ Rule for a Single Observation: 

Statistical decision problems in which there are just two 
possible outcomes constitute an important class called 
binary hypothesis testing problems. The possible states of a 
system are called hypotheses and each individual state of the 25 
system is termed a simple hypothesis. A simple hypothesis 
is a complete specification of a probability distribution for 
the system (i.e., the distribution of possible observations or 
samples from the system). The “hypothesis” being tested is 
that the particular distribution is the correct one. 30 

The basic operation in a binary hypothesis test is to 
evaluate the veracity of a hypothesis, H, given a piece of 
evidence or observation, e, from the system being studied. 
Because of the unpredictability or element of chance inher- 
ent in the system, the test deals with the probabilities that 35 
events occur or that hypotheses are true. The probability that 
a hypothesis is true given a piece of evidence is written as 
P(Hle). The notation identifies a conditional probability — 
namely the probability that the hypothesis is true under the 
condition that the event has occurred with absolute certainty. 40 
Bayes’ well known inversion rule, as described in Probabi- 
listic Reasoning in Intelligent Systems: Networks of Plau- 
sible Inference, by Judea Pearl, Morgan Kaufmann Publish- 
ers, Inc., San Mateo, Calif., 1988, Second Edition at pages 
29 through 39, provides a means for evaluating the condi- 45 
tional probability of a hypothesis, P(Hle), which is also 
known as the posterior probability, 


P(e | H)P{H) (E7) 50 


P(elH) is the probability that the observation would occur 
if the hypothesis is true. P(H) is the probability that the 55 
hypothesis is true before any observations of the system 
have been made, also known as the prior probability. The 
denominator, P(e), is the unconditional probability that the 
observation occurs. 

Basic axioms of set theory can be used to prove the 60 
following identity for two events R and S: R=(RnS)U 
(RPl - lS), where — iS is the converse of event S. In prob- 
ability theory, the analog of this identity is 

P(R)=P(R,S)+P(R, “IS), (E8) 

65 

where the notation P(R,S) is used represent the probabil- 
ity of the joint event RHS. The multiplication law states that 


P(e)=P(e\H)P(H)+P(e\~~iH)PC~iH) (E10) 

Therefore, Bayes’ rule becomes 


W\e) = 


J\e\H)P{H) 


P(e | H)P(H) + P(e\ 


1 H)P{ 


1 H) 


(Ell) 


The power of Bayesian techniques comes primarily from 
the fact that in causal reasoning the relationship P(elH) is 
local, namely, given that H is true, the probability of e can 
be estimated naturally and is not dependent on many other 
propositions. For instance, given that the measurements 
from an experiment adhere to a particular PDF, the prob- 
ability that any single measurement will occur is easily 
computed. 

The complementary form of Bayes’ rule provides the 
posterior probability for the converse of the hypothesis. It is 
evaluated by substituting — iH for each instance of H in 
equation Ell and noting that — 1( — iH)=H: ] 


PC 


1 H\e) = 


P(e\- 


1 H)PC 


1 H) 


P{e\H)P(H) + P(e\ m 


1 H)PC 


w 


(El 2) 


An alternate form of Bayes’ rule is produced by dividing 
it (i.e., equation Ell) with its complementary form (i.e., 
equation El 2) to obtain 


P{H\e) 


P{e\H)P{H) 


PC 


We) P(ef 




(El 3) 


This form of Bayes’ rule is further manipulated by first 
defining the prior odds on hypothesis H as 


0{H) = 


pm 

Pi \H)' 


the likelihood ratio as 


Ue\H) -- 


P(e\H) 

P(e I I //)’ 


(EI 4 ) 


(El 5) 
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and the posterior odds on H as 


0(H\e) = 


P( 


PWe) 
1 H\e ) ' 


20 

random), then the probabilities will be dependent on the 
hypothesis only. In this case the conditional probability for 
the ith observation, P(y z ly z _i, . . . , y l5 H), is simply P(y z IH). 
(El 6) Thus, Bayes’ rule for independent observations in a time 

5 series is 


Bayes’ rule then specifies the posterior odds as the 
product of the likelihood ratio and the prior odds, 

0(H\e)=L{e\H)0(H). (E17) 10 


PiH\ yi ,y 2 ,... ,y n )= (E22) 

P(y n \H) ... P( yi \H)P(H) = A P{yj\H) 

P(y n ) ■■■ P(yi) P(yi) 


Bayes’ Rule for a Time Series: 

The formulation above specifies Bayes’ rule for a single 
observation and a binary hypothesis. For the application of 
binary hypothesis tests to real world signals, the formulation 
must be able to handle a sequence of discrete observations. 
This is accomplished by beginning with a single observation 
and successively updating Bayes’ rule for each successive 
observation. Let the sequence {Y„} be an ordered set of n 
elements, {Y„}=y l5 y 2 , . . . , y M , in which the elements are 
observations of the signal made at n discrete moments in 
time such that t x <t 2 < . . . <t„. Bayes’ rule for the first 
observation (y : ) in the time series is 


P(H\yi) = 


P( yi \H)P(H) 
P(yi ) 


(El 8 ) 


If an explicit time dependency can be established for the 
observations in the time series (i.e., a function is found that 
15 relates earlier events to later events), then the general form 
of Bayes’ rule (equation E21) can be used to develop failure 
detection models for serially-correlated signals. However, 
the residual signals formed by the difference between the 
observed and estimated signal values are in general random 
20 signals; thus, Bayes’ rule for random time series is used as 
the basis for the fault detection models. 

Dividing by the complementary form of Bayes’ rule for 
random time series and utilizing the definition of the pos- 
terior odds, prior odds, and likelihood ratio from above, an 
25 alternate form of Bayes’ rule for a time series is developed: 


FT (E23) 

Adding the second observation from the time series, — , y n ) - °(H )[ } Uy,\ H )- 

Bayes’ rule for the joint event y^y-j is 


x P( yi ,y 2 \H)P(H) (El 9) 

P(H\yi,y 2 ) = - . 

P(yi,y2) 

With the aid of the multiplication law (equation E9), the 
joint event probabilities are converted to conditional prob- 
abilities so that the right hand side of this equation can be 
rewritten as 


0^1 x P(y 2 \yi,H)P( yi \H)P(H) (E20) 

45 

Note that the probability of the joint event y 1 C\y 2 is 
written as P(y l5 y 2 )=P(y 2 lyi) P(yi) instead of the equivalent 
p(yi, y 2 )=P(yily 2 ) P(Y 2 ) because of the temporal depen- 
dency of the data. The second form of the multiplication law 50 
reduces to P(y l5 y 2 ) = P(yi) P(y 2 ) because earlier events (e.g., 
y x ) in a time series cannot be dependent on later events (e.g., 

y 2 )- 

The multiplication law is used for each successive obser- 
vation in the time series to derive the form of Bayes’ rule for 55 
the joint event (yiriy 2 n . . . Hy„): 

nmyuyi,- ,yn)= (E 21 ) 

Pjyn\yn-u - .> 1 . H) ... H)P(yi\H)P(H) 60 

J’O'nb'n-i. ... ,y0 ... /’teLyO/’Cyi) 


If we take the logarithm of this equation, the incremental 
nature of Bayesian formulation becomes more apparent. 
Equation E24 shows the log of the likelihood ratio as a 
weight, carried by each observation in the sequence, which 
additively sways the belief in the hypothesis one way or the 
other. 


A (E24) 

HO(H\yu > 2 , - , y n )] = Worn + £ MUyiW] 


Sequential Hypothesis Tests: 

Wald first presented and studied the following sequential 
test of a simple hypothesis against a simple alternative, as 
described in Sequential Analysis, by A. Wald, John Wiley 
and Sons, Inc., New York, 1947. Let H 0 be a specific 
probability density function called the null hypothesis. Then 
the probability that the time series {Y„} contains samples 
drawn from H 0 is P(y l5 y 2 , . . . , y w IH 0 ). Let H be a different 
probability density function called the alternative hypoth- 
esis. Then the probability that the time series {Y„} contains 
samples drawn from H^ is P(y l5 y 2 , . . . , yJH^). Two 
threshold limits A and B are chosen, with A<B, and after 
each observation in the series the following statistic (A^) is 
calculated: 


_ PO' 1 , 3 ' 2 ,- ,y n \Hj) (E25) 

iJ * P(yi,y 2 ,... , y n \H 0 ) 


Since the probabilities for each of the later observations in 
the time series are conditioned on earlier observations, 65 
Bayes’ rule is difficult to solve in the general case. But if the 
observations in the series are independent of each other (i.e., 


The test procedure is then as follows. If the statistic is 
greater than or equal to the upper threshold limit (i.e., 
A^i^B), then a decision is made to accept hypothesis H^. as 
true. If the statistic is less than or equal to the lower 
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threshold limit (i.e., A^^A), then a decision is made to 
accept hypothesis H 0 as true. If the statistic falls between the 
two limits (i.e., A<A 7> <B), then neither hypothesis can yet 
be accepted to be true and sampling continues. If the 
observations in the series are independent random variables, 
then the test statistic reduces to a product of conditional 
probabilities: 


J ’ n y P(yi\H 0 y 


(E26) 


while the posterior odds are defined as 


5 


Oj(H 0 \e) = 


P(Hj\e) 
P(Ho I e ) ' 


(E 30) 


Assuming the observations in the time series are inde- 
pendent, then Bayes’ rule conditioned on hypothesis H . (i.e., 
equation E22 with the symbol H replaced by H,-) can be 
divided by Bayes’ rule conditioned on hypothesis H 0 to 
produce Bayes’ rule for a sequential hypothesis test: 


In the sequential hypothesis tests, the logarithm of the test 
statistic is often easier to work with: 


n 

Oj(H 0 \ yi , y 2 , ... , y n ) = Oj(H 0 )Y\ Lj( yi \ H 0 ). 


(E 31) 


In A = 



PiyAHj) 
P( yi \H 0 ) 


(E27) 
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The sequential hypothesis test consists of calculating the 
logarithm of the test statistic for each observation in the 
series and comparing the result to the logarithms of the 25 
lower and upper threshold limits. 

The statistic in the sequential hypothesis test is a product 
of a sequence of likelihood ratios. Each term in the product 
is the ratio of a probability conditioned on one hypothesis to 
a probability conditioned on a second hypothesis. The 
difierence between the likelihood ratios in the sequential 
hypothesis tests and those in Bayes’ rule (see equations El 5 
and E23), is that in Bayes’ rule the probabilities are condi- 
tioned on a hypothesis and its converse, whereas in the 35 
sequential hypothesis tests the probabilities are conditioned 
on two hypotheses from a set of related hypotheses. In 
principle, the two hypotheses in the sequential hypothesis 
tests could be the converse of each other (i.e., the set 
contains two elements), but in practice the hypotheses are 40 
selected from an infinite set of exhaustive and mutually 
exclusive hypotheses. For instance, suppose the null hypoth- 
esis is a Gaussian PDF with a mean of 1 and a variance 10. 
Then a sequential hypothesis test can be defined in which the 
alternate hypothesis is any Gaussian PDF in which the mean 45 
is not 1 and/or the variance is not 10. 

It is informative to compare the sequential hypothesis 
tests to Bayes’ rule for a time series. Given a null hypothesis 
H 0 and an alternative hypothesis H y , the likelihood ratio for 5Q 
an observation e conditioned on the two hypotheses is 
defined as 


Lj(e\H 0 ) = 


P{e\Hj) 

P(e\Ho)' 


(E 28) 55 


The subscripts emphasize the fact that the two hypotheses 
are selected from an infinite set of related hypotheses. The 
prior odds for the two hypotheses are defined as 


Oj(H 0 ) ■■ 


P{Hj) 

P{HoY 


(E 29) 
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Dividing this equation through by the prior odds, it 
becomes apparent that the statistic in the sequential hypoth- 
esis test is just the ratio of the posterior odds to the prior 
odds: 


Oj(H 0 | yi , y 2 , ... , y n ) 
Oj(Ho) 


H Lj(y, | H 0 ) = A,-., 


(E 32) 


The prior odds are the ratio of the probability that the 
alternative hypothesis is true to the probability that the null 
hypothesis is true, before any data have been collected from 
the system. In many cases, these probabilities are not 
known — no prior information about the system is known. In 
the absence of prior information about the system, these 
probabilities are taken to be Vi, making the prior odds equal 
to 1. Thus in the absence of prior information, the test 
statistic A y „ is equal to the odds that the system adheres to 
the alternative PDF as opposed to the null PDF. 

Error Probabilities for the Sequential Hypothesis Tests: 

Because the sequential hypothesis tests make decisions 
based on probabilities, there is always a finite probability 
that a decision reached by the test is erroneous. If a sequen- 
tial hypothesis test makes a decision to accept the alternative 
hypothesis as true when the null hypothesis H 0 is true, 
then an error of type I is said to have occurred. If a sequential 
hypothesis test makes a decision to accept the null hypoth- 
esis H 0 as true when the alternative hypothesis is true, 
then an error of type II is said to have occurred. Although the 
designation is arbitrary, it stems from situations in which one 
kind of error is more serious than the other. Since the normal 
usage of the sequential hypothesis tests is to detect a change 
in signal response from its normal behavior (i.e., hypothesis 
H 0 ) to some abnormal behavior (i.e., hypothesis H^), the 
error of accepting H, when H 0 is true is the more serious 
error. The probability that a decision to accept hypothesis H, 
is erroneous is denoted by a. A type I decision error is also 
called a false alarm and the probability of a type I error is 
called the false alarm probability. The probability that a 
decision to accept hypothesis H 0 is erroneous is denoted by 
(3. A type II decision error is also called a missed alarm and 
the probability of a type II error is called the missed alarm 
probability. 

The sequential hypothesis tests are open-ended. The tests 
will continue to collect observations from the system and 
update the test statistic until the test statistic satisfies one of 
the two decision conditions. In principle, the number of 
observations needed to reach a decision can be any positive 
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integer, although it can be shown that a decision will be 
reached in a finite number of observations. Since the number 
of observations needed to make a decision is indeterminate, 
the probability that a decision is erroneous is found by 
summing the probability of an erroneous decision being 5 
made after 1 observation, 2 observations, and so on. For- 
mally, in terms of the threshold limits A and B that define the 
test, the false alarm probability is given by: 

a=P(Aj^B\H 0 )+P(B<A ja <A, A ja ^B\H 0 )+ . . . (E33) 

The first term in the sum is the probability that the test 
statistic drops below the lower threshold limit after only one 
observation given that the alternative hypothesis is true. The 
second term is the probability that the test statistic drops 
below the lower threshold limit after two observations given 15 
that the alternative hypothesis is true. Similarly, the missed 
alarm probability is given by: 

$=P(A Jyl ^A\Hj)+P(B<A ja <A, Aj2 =A\Hj)+ . . . (E34) 

These expressions are by no means easily computed. 2 o 
Moreover, one could not hope to solve these equations for A 
and B in terms of given a and (3, despite the desirability of 
being able to do so in setting up a test to provide a specified 
protection. Although these equations cannot be solved, it can 
be shown that the error probabilities and the threshold limits 25 
are related by the following inequalities 


a 1 ft (E 35) 

A > and — > . 

(1 - P) B (1 -or) 30 

The error probabilities and the threshold limits are related 
by inequalities because the test statistic A Jn does not usually 
attain exactly the value A or the value B when the test is 
completed. But since a decision is declared as soon as an 
observation drives the test statistic past either threshold, the 
inequalities are almost equalities. Indeed, in practice, A and 
B are taken to be equal to a/(l-|3) and (l-a)/|3, respectively. 
Doing so, of course, means that the sequential hypothesis 
test actually carried out has error probabilities that are 
somewhat different than those specified. Let a’ and (3* denote 
the empirical error probabilities actually attained by a test 
using specified threshold limits of A=a/(l-(3) and B=(l-a)/ 

(3. Then according to the inequalities in equation E35, the 
empirical error probabilities (i.e., a' and (3') are related to the 
preassigned error probabilities (i.e., the values of a and (3 
used to specify the threshold limits) by 


a' a {¥1/3 

< A = and < — = . 

(1-jSO (1 - ft) (1-arO B (1 -or) 


(E 36) 
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Multiplying these through to eliminate denominators, one 
obtains 55 

a'-a'p^a-afJ 1 and (E37) 

Adding these two equations together, one obtains an 
inequality relating the empirical error probabilities to the 
preassigned error probabilities: 60 

a’+P’^a+p. (E38) 

One of the key features of the hypothesis test technique is 
that the designer can specify the error probabilities. This is 
particularly important for type I errors, because false alarms 65 
can cause an operator to make an incorrect decision. Type II 
errors typically do not lead to incorrect decisions. This is 
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because in the event that a real failure does occur, missed 
alarms may delay the time to detect the failure but not the 
ability to detect the failure. The result above (equation E38) 
shows that the preassigned false alarm probability is not a 
strict upper limit for the empirical false alarm probability. 
Similarly, the preassigned missed alarm probability is not a 
strict upper limit for the empirical missed alarm probability, 
even when the hypothesis test is applied to purely random 
data. It is the sum of the preassigned error probabilities that 
is an upper limit for the sum of the empirical error prob- 
abilities. Thus it is possible with purely random data for one 
of the empirical error probabilities to exceed its correspond- 
ing preassigned error probability, but both empirical error 
probabilities cannot be greater than their corresponding 
preassigned error probabilities. 

True upper bounds for the empirical error probabilities 
can be determined from the inequalities in equation E36, 


, oc a 

a s (T7so s (TUs) 


and ft < 


P „ P 

(!-<*') " (l-orT 


(E 39) 


For small preassigned error probabilities, the true upper 
bounds are only slightly greater than the preassigned error 
probabilities. For instance, if both of the preassigned error 
probabilities are 0.01 then the empirical error probabilities 
of the test will not exceed 0.0101. 

The Sequential Probability Ratio Test (SPRT): 

The sequential hypothesis tests described herein above are 
general statistical tests valid for any pair of related hypoth- 
eses. MSET has historically employed four specific sequen- 
tial hypothesis tests to detect signal faults. These four tests 
are called the Sequential Probability Ratio Tests, or SPRTs. 
The SPRTs monitor for changes in the statistical character- 
istics of the residual signals. A residual signal is the differ- 
ence between an actual signal and MSET’s estimate of that 
signal. The SPRTs continually monitor a residual signal, 
generating sequences of decisions. A decision in which the 
null hypothesis is accepted (i.e., A Jn ^A) is called a normal 
decision and implies that the residual signal is behaving as 
anticipated. A decision in which the alternative hypothesis is 
accepted (i.e., A J n ^B) is called a fault decision and implies 
that the residual signal is behaving abnormally. 

The null hypothesis upon which the SPRTs are based 
specifies that the residual signal consists of Gaussian data 
that have a sample mean of 0 and a sample variance of cr 2 . 
A training procedure, during which the system is operating 
normally, is used to verily that the mean of the signal is 0 and 
to evaluate the variance of the signal. Note that if the 
residual signal does not have a mean of 0, the calculated 
mean from the training phase is used to normalize the 
residual signal for the surveillance procedure using the 
model. Thus the null hypothesis, H 0 , for the SPRTs is that 
the signal being analyzed adheres to Gaussian PDF (N(y; p, 

a 2 )), 


N(y\ fi, cr 2 ) = 


-\pln 


exp 


2 cr 2 


(y-pY 


(E 40) 


for which the mean (p) is 0 and the variance is a 2 . Thus, 
the conditional probability that a discrete observation (y f ) 
occurs given the null hypothesis is expressed as 


Piy^Niys 0, a 2 ) 


(E41) 
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The four SPRT tests historically used with MSET are the 
positive mean test, the negative mean test, the nominal 
variance test, and the inverse variance test. For the positive 
mean test, the corresponding alternative hypothesis, H l5 is 
that the signal data adhere to a Gaussian PDF with mean+M 5 
and variance a 2 , where M is the preassigned system distur- 
bance magnitude for the mean test. For the negative mean 
test, the corresponding alternative hypothesis, H 2 , is that the 
signal data adheres to a Gaussian PDF with mean-M and 
variance a 2 . Referring to FIG. 10, the hypotheses for the two to 
mean tests are illustrated for M=2 and a 2 =l . For the nominal 
variance test, the alternative hypothesis, H 3 , is that the signal 
data adheres to a Gaussian PDF with mean 0 and variance 
Va 2 , where V is the preassigned system disturbance mag- 
nitude for the variance test. For the inverse variance test, the 15 
corresponding alternative hypothesis, H 4 , is that the signal 
data adheres to a Gaussian PDF with mean 0 and variance 
a 2 /V. Referring to FIG. 11, the hypotheses for the two 
variance tests are illustrated for V=2 and cr 2 =l. The condi- 
tional probabilities that a discrete observation (y z ) occurs 20 
given one of the four alternative hypotheses are expressed as 


The SPRT index for the nominal variance test (SPRT wom ) 
is given by 


InA 3 ,„ = 
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The SPRT index for the inverse variance test (SPRT ZKV ) is 
given by 


P(y> I HO = N(yr, M, cr 2 ), 
P(y> I Hi) = N(ya -M, (T 2 ), 
P(yi \H 3 )= N(yi w , 0, Vcr 2 ), 

P(y; | H 4 ) = 0, j. 


(E 42) 
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The logarithm of the test statistic for the four SPRT tests 
can be evaluated by substituting the conditional probabilities 
from equations E41 and E42 into the general formula 
(equation E27) and simplifying. Thus, for the positive mean 
test, 


lnAi ; „ 


Z \N(yr, M,^)' 
ln N(yr, 0, cr 2 ) 




Z yi 2 -( yi -M) 2 
2cr 2 



-j = SPRT pos . 
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The equation E43 defines the SPRT index for the positive 
mean test (SPRT^J. A SPRT index is defined for each of the 
four sequential hypothesis tests. The SPRT indices are the 
actual quantities computed by the SPRT fault detection 55 
procedure. The SPRT index for the negative mean test 
(SPRT„ cg ) is given by 




Z r -M, cr 2 )' _ 
N(yr,0, cr 2 ) 
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Novel Improvements to the Fault Detector Training and 
Surveillance Procedures: 

Having described the SPRT fault detector training and 
surveillance methods herein above, the section below 
describes the novel improvements made by the instant 
invention when used for fault detector training and surveil- 
lance, said improvements being applicable to any asset 
preferably having at least one characteristic data signal. 
The Adaptive Sequential Probability (ASP) Test: 

The SPRT technique is valid only for residual signals that 
are random and adhere to Gaussian distributions. Because 
the MSET parameter estimation algorithm simulates the 
deterministic components of a set of inter-related raw sig- 
nals, the residual signals that are produced within MSET are 
often white-noise processes (i.e., not serially-correlated). 
Although the MSET residual signals are typically white, 
they are often non-Gaussian (not normally distributed). If 
the SPRT fault detection model is applied to signals that are 
either non-Gaussian, non- white, or both, the theoretical false 
and missed alarm probabilities may not be achieved. The 
novel advanced fault detection method of the instant inven- 
tion, herein named the Adaptive Sequential Probability 
(ASP) technique or method, solves the problem associated 
with non-Gaussian, non- white residual signals. The ASP 
method uses binary hypothesis tests that are numerically 
tuned to better accommodate the non-Gaussian data distri- 
butions that are typical for residual error signals produced by 
parameter estimation based techniques, such as MSET. 
Mathematical Foundations of the ASP Test: 

The Adaptive Sequential Probability (ASP) method 
defines four new sequential hypothesis tests. The Adaptive 
Sequential Probability method is an advanced failure detec- 
tion technique that broadens the domain of applicability of 
the SPRT technique to non-Gaussian PDFs. In the ASP 
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method, the assumption that the data fit a Gaussian PDF is 
relaxed and the test statistic is evaluated for any arbitrary 
data distribution. In the ASP method, the signal is assumed 
to consist of random observations that adhere to a specific 
PDF that is a function of the sample mean, variance, and 5 
possibly higher order terms. The PDF is denoted by the 
general function 3(y; g, a 2 , . . . ). The parameter list of the 
function is open-ended to indicate that additional terms, 
such as the sample skewness, kurtosis, or width of the 10 
distribution at half-maximum, may be required to charac- 
terize the function. 

The null hypothesis upon which the ASP tests are based 
specifies that the data distribution has a sample mean of 0 
and a sample variance of cr 2 . A training phase, during which 
the system is operating normally, is used to verify that the 
mean of the signal is 0 and to evaluate the sample variance. 

If the PDF is dependent upon any other additional terms, 
they are also evaluated during the training phase for numeri- 20 
cally tuning one or more probability functions obtained by 
fitting an open-ended general probability functions to one or 
more data distributions obtained during typical or normal 
operating conditions of the asset under surveillance. Thus, 
the conditional probability that a discrete observation (y z ) 25 
occurs given the null hypothesis is expressed as 

P(K i Lff 0 )=3(F I .; 0, a 2 , ... ) (E47) 

Typically, four sequential hypothesis tests (i.e., a positive 3Q 
mean test, a negative mean test, a nominal variance test, and 
a inverse variance test) are utilized by the ASP method. For 
the positive mean test, the corresponding alternative hypoth- 
esis, H l5 is that the signal data adhere to the specified PDF 
with mean+M and variance cr 2 where M is a preassigned 35 
system disturbance magnitude for the mean test. For the 
negative mean test, the corresponding alternative hypoth- 
esis, H 2 , is that the signal data adheres to the specified PDF 
with mean-M and variance a 2 . For the nominal variance 

40 

test, the corresponding alternative hypothesis, H 3 , is that the 
signal data adhere to the specified PDF with mean 0 and 
variance Va 2 where V is a preassigned system disturbance 
magnitude for the variance test. For the inverse variance test, 
the corresponding alternative hypothesis, H 4 , is that the 45 
signal data adhere to the specified PDF with mean 0 and 
variance c^/V. The conditional probabilities that a discrete 
observation (y.) occurs given one of the four alternative 
hypotheses are expressed as 


P(y> 1 HO = 


P(y ; 1 h 2 ) = 


P(y ; 1 h 3 ) = 

3tyi;0, Va 2 , ...), 


( 2 \ 

P(yi 1 h 4 ) = 



The logarithm of the test statistic for the four ASP tests 
can be evaluated by substituting the conditional probabilities 
from equations E47 and E48 into the general formula 
(equation E27). The logarithm of the test statistic for a given 65 
test is defined to be the ASP index for that test. The ASP 
index for the positive mean test (ASP po J is given by 


in A 


-£■ 


cr 2 , •••) ' 

30',; 0 , a- 2 , ■■■) 


■ ASP DC 


(E49) 


The ASP index for the negative mean test (ASP we ^) is 
given by 


In h.2,n - 
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Zs(yr,-M,o- z , -) ' 
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(E50) 


The ASP index for the nominal variance test (ASP wom ) is 
given by 


In A 3 jfI = 



3(y.-;Q. vo 2 ,-) 

3 O';; 0, cr 2 , ••■) 


= ASPfiom. 


(E51) 


The ASP index for the inverse variance test (ASP ZWV ) is 
given by 


In A 4 „ = 


S' 




30',-; 0, v 2 , ■■■) 


(E52) 


The ASP tests are then implemented in the same manner 
as the SPRT tests. Namely, for each time step in the 
calculation, the four ASP indices are calculated (equations 
E49 through E52). Each ASP index is compared to the upper 
and lower log-thresholds and the status of the test is evalu- 
ated (i.e., faulted, normal, or continuing). 

ASP Method for Near-Gaussian Distributions: 

The Adaptive Sequential Probability method consists of 
four specific sequential hypothesis tests applied to non- 
Gaussian data distributions. In order to use the method, a 
general PDF, 3(y; g, cr 2 , . . . ), must be defined for the target 
signal. In this section, the ASP method is derived for data 
distributions that are nearly Gaussian. 

In applications of MSET to the Space Shuttle Main 
Engine accelerometer signals, applicants discovered that the 
residual signals produced by the system model have a nearly, 
but not truly, Gaussian behavior, as described in Dynamics 
Sensor Data Validation Phase I Final Report, by applicants: 
Randall L. Bickford and James P. Herzog, NASA Contract 
NAS8-40874, Jul. 1, 1997. When plotted as a histogram, the 
residual data will appear to have the same bell-curve shape 
as a Gaussian distribution. But when a Gaussian PDF of the 
same mean, standard deviation, and area as the data distri- 
bution is superimposed on it, the histogram was found by 
applicants to be non-Gaussian. Typically, applicants found 
that the histogram has thicker tails than the Gaussian curve, 
which corresponds to a sample kurtosis that is greater than 
3. 

A PDF can be written as a sum of a standard normal 
distribution, Z(x), and a remainder term, R(x), as described 
in Mathematical Methods of Statistics, by H. Cramer, Prin- 
ceton University Press, Princeton, N.J., 1946. 

3(jc)=Z(x)+i?(jt), (E53) 
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where the standard normal PDF 



is a function of the dimensionless variable x=(y-ja)/a. The 
standard normal PDF is related to the general Gaussian PDF 
(equation E40) through the normalization condition: 


He 6 (x)=x 6 - 1 5x 4 +45x 2 - 1 5 , 
i/e 7 (jc)^: 7 -21;t 5 +l 05x 3 - 1 05x, 

He 8 (x)=x 8 -2%x 6 +2 10x 4 -420x 2 +l 05 , 
He 9 (x)=x 9 -36x 7 +378x 5 - 1 260x 3 +945x, 
fTe lo (jc)=Jc lo -45x 8 +630x 6 -3 150x 4 44725x 2 -945, 
^ 11 (x)^c 11 -55a: 9 +990x 7 -6930x 5 +17325x 3 -10395x j 


Z(x)dx = J N(y; ju,a 2 )dy = 1. (E55) 

15 

The remainder term can be expressed as an infinite series 
of orthogonal polynomials whose terms are functions of 
derivatives of the standard normal PDF. The constant coef- 
ficients of the polynomial terms are dependent on the central 
moments of the target distribution. As described in Hand- 20 
book of Mathematical Functions, by M. Abramowitz and I. 

A. Stegun, Dover Publications, Inc., New York, 1972, the 
first four terms in the series expansion of the remainder 
function are: 


He l2 (x)=x 12 -66x 10 + 1 485x 8 - 138 60jc 6 +5 1 975.x 4 - 

62370x 2 +10395. (E59) 

Each term in the series expansion of the remainder 
function R(x) is more complex than the previous term as 
higher central moments of the data distribution and higher 
derivatives of the standard normal PDF are introduced. The 
nth term in the remainder function depends on the central 
moments from order 1 (i.e., mean) through order n+2. Thus, 
the first term depends on moments through the third central 
moment (i.e., skewness), the second term depends on 
moments through the fourth central moment (i.e., kurtosis), 
and so on. 


(E56) 


[i(^-15^-10(^f + 30)z<« W + (g 
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The p* factors are central moments of the discrete data 
sequence { Y„}=(y 1? y 2 , . . . , y„), 
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The Z^ n \x) functions are derivatives of the standard 
normal PDF. The nth derivative of the standard normal PDF 
is given by: 


50 


2 n \x) = ^-Z{x) = (-1 THe n {x)Z{x), 
dx n 


(E58) 
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where He„(x) are the Hermite polynomials. The first 
twelve Hermite polynomials are 

He 1 (x)=x, 

He 2 (x)=x 2 -1, 


If the data distribution that is to be approximated with 
equation E53 is nearly Gaussian, a good approximation will 
be achieved with only a few of the terms in the series. In the 
ASP method for near-Gaussian distributions, the data dis- 
tribution is first approximated with the standard normal PDF, 
Z(x). The approximation is refined by adding successively 
higher terms of the remainder function until an adequate fit 
between the general PDF, 2?(x), and the data distribution is 
produced. In practice it is usually not advisable to go beyond 
the fourth term in the remainder function because the tail 
regions of the general PDF become unstable. 

Using the equations for the derivatives of the standard 
normal PDF (equations E58 and E59), the remainder func- 
tion can be written as a product of the standard normal PDF 
and a polynomial whose coefficients depend on the central 
moments of the data distribution. 


r(x) = Z{x)Y j ri {x). 


(E60) 
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where the first four terms in the series are 




He 3 (x)=x 3 -3x, 
He 4 (x)=x 4 - 6x 2 +3 , 
He 5 (x)=x 5 - 1 0x 3 + 1 5x, 
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nW = 24(^4 ~ 3 ) H e 4 (x)+ ^(^3) He 6 (x), 

'' 3W = I5o(^- 10 ^Kw + 
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, , 1 ( 1*6 .-W , n //<3 1 2 . . (E 64 ) 1° 

'' 4W = 72o(^- 15 ^- 1 °b) +30 K W + 

m&fO-'Kn+inriZrw „ 


Thus, the series expansion formula for the approximation 
of the general PDF is 


^(*)*ZW + ZW^r/(jc). 


20 

(E65) 


Application of the ASP Method to Space Shuttle Main 
Engine Residual Signals: 

In one application use and operation, and in one preferred 
embodiment, the ASP technique was reduced to practice to 
provide highly reliable data validation for high frequency 30 
Space Shuttle Main Engine (SSME) accelerometers. Each 
SSME is monitored by eight high frequency (10.24 kHz) 
accelerometers with three sensors mounted on each of the 
two high pressure turbopumps and two sensors mounted on 
the engine gimbal bearing. Several MSET models were 35 
prepared to validate the ASP method for the six turbopump 
accelerometers. The MSET pattern recognition algorithm 
produces extremely accurate simulations of turbopump 
accelerometer behavior. Referring to FIG. 12, some results 
from applying the six sensor MSET model to a typical Space 40 
Shuttle flight (i.e., flight STS057, engine 1) are shown. For 
the calculation shown in FIG. 12, the MSET training algo- 
rithm extracted a small fraction of the data in the acceler- 
ometer signals to simulate the behavior of the accelerom- 
eters (the process memory matrix contained only 0.0083% 45 
of the raw signal data). In the top plot, the highly dynamic 
signal from the first SSME turbopump accelerometer in the 
model (i.e., the oxidizer prebumer pump 45 accelerometer) 
is shown as a function of time from the launch of the Space 
Shuttle. In the middle plot, MSET’s estimate of the accel- 50 
erometer signal shows the very high accuracy of the MSET 
simulation. The residual signal for the calculation, which is 
the difference between the sensor signal and MSET’s esti- 
mate of that signal, is shown in the bottom plot. The relative 
error in the calculation, as measured by the ratio of the 55 
standard deviation of the residual signal to the standard 
deviation of the sensor signal, is only 0.86%. The results 
shown in FIG. 12 illustrate MSET’s ability to accurately 
model highly dynamic signals, even when only a tiny 
fraction of available data is used to represent the signal 60 
behavior. 

Still referring to FIG. 12, the residual signal for the 
analysis of a typical SSME accelerometer appears to be 
random. Referring to FIG. 13, the distribution of data in the 
residual signal is shown. The probability density for the 65 
residual signal was approximated by dividing the data range 
into m equally-sized bins and then counting the number of 
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residual signal data values that fall into each bin. The count 
number is a discrete function of the sequence {Y m }=y 15 
y 2 , . . . , y m , where the ith element in the sequence (y f ) is 
given by 


. Cymax — ^min ) (E66) 

yi = 3 >min + l , 


and y min is the minimum datum and y max is the maximum 
datum in the residual signal. A normalized residual signal 
PDF is produced by dividing each element in the count 
number function by the total number of elements in the 
residual signal. The number of bins used to produce the 
residual signal PDFs was 1000. As shown in FIG. 13, a PDF 
for a typical residual signal has a nearly Gaussian shape. 
Superimposed on top of residual signal PDF is a Gaussian 
PDF of the same mean, standard deviation, and area as the 
residual signal PDF. The first four moments of the residual 
signal data are as follows: mean=l .1*1 CT 4 g, standard devia- 
tion=8.2*10 -2 g, skewness=0.32, and kurtosis=5.7, where 
the skewness and kurtosis are related to the third and fourth 
central moments of the data by 


skewness = ^4 and kurtosis = . 

cr 5 cr 4 


(E67) 


Because the kurtosis of the distribution is greater than 3, 
the residual signal PDF has thicker tail regions than a true 
Gaussian distribution. This is confirmed by a visual exami- 
nation of FIG. 13. 

The ASP method for near-Gaussian distributions was 
applied to the residual signals from the MSET model of the 
SSME accelerometers. The residual signal PDFs were 
approximated with the series expansion formula of a general 
PDF (equations E60 through E65). Four calculations were 
performed for each residual signal in the model. In the first 
calculation, the residual signals were approximated with the 
one-term series expansion formula. In the one-term formula, 
the remainder function (R(x)) is given by the product of the 
standard normal PDF and the first term in the series, r^x). 
Subsequent calculations introduced additional terms from 
the series, culminating with the four-term formula in which 
the remainder function is given by the product of the 
standard normal PDF and the four term series, r 1 (x)+r 2 (x)+ 
r 3 (x)+r 4 (x). 

For each calculation, the quality of the approximation is 
measured by the root mean- squared (rms) error of the 
calculation. The rms error (E rm J is a function of the differ- 
ence between the calculated PDF (3(x)) from equation E65 
and the residual signal PDF (F(x)), 


Erins = 


-f. □ (*■) - F(x i )f 


(E68) 


The calculated and residual signal PDFs are discrete 
functions of the dimensionless variable x z =(y z -|i)/a, where 
y z is an element in the sequence that spans the range of data 
in the residual signal distribution (see equation E66). 

In general terms, each additional term in the series expan- 
sion improves the approximation of the residual signal 
PDFs. Referring to FIG. 14, FIG. 15 and FIG. 16, adding 
terms from the series expansion formula improves the fit of 
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the residual signal PDF from the first sensor. The approxi- 
mation of the residual signal PDF generated by the one-term 
series expansion formula is compared to a Gaussian approxi- 
mation and the residual signal PDF in FIG. 14. The one-term 
approximation shows little improvement over the Gaussian 
approximation. The rms error for the Gaussian PDF is 0.414, 
while the rms error for the one-term series expansion PDF 
is slightly smaller at 0.409. The PDF generated by the 
two-term series expansion formula is shown in FIG. 15. The 
fit of the two-term approximation is significantly better than 
that of the Gaussian approximation. The two-term formula 
provides an excellent fit of the residual signal PDF, espe- 
cially near the peak in the distribution. The greatest source 
of error in the approximation occurs in the transition regions 
of the distribution, between 1 and 4 standard deviations from 
the mean. The rms error for the two-term series expansion 
PDF is 0.122, which is nearly a factor of four smaller than 
the rms error of the Gaussian PDF. The PDF generated by 
the three-term series expansion formula is shown in FIG. 16. 
The three-term formula provides a better fit of the peak in 
the distribution than does the two -term formula. The rms 
error for the three-term approximation is 0.151, which is 
slightly larger than that of the two -term approximation 
because its fit of the lower transition region is less accurate 
than that of the two-term approximation. 

The rms errors of the four calculations for each of the 
sensors in the model are listed in FIG. 17. Also included in 
FIG. 17 are the rms errors for Gaussian PDFs of the same 
mean, standard deviation, and area as the residual signal 
PDFs. As indicated by the data in the first two columns of 
FIG. 17, the one-term series expansion formula produces 
PDFs that are slightly more accurate than the corresponding 
Gaussian PDFs. Significant improvement is exhibited by the 
two-teim series expansion formula, which produces PDFs 
whose rms errors are as much as a factor of four smaller than 
those from the corresponding Gaussian and one- term series 
expansion approximations. The three-term series expansion 
formula generally produces larger rms errors than does the 
two-term formula because the fit of the transitions regions of 
the distribution is less accurate. This trend is further exhib- 
ited by the four-term series expansion formula. For three of 
the calculations (i.e., sensor numbers 1, 2, and 6), the 
four-term approximations are unstable in the transition 
regions, resulting in rms errors that are much larger than 
those of the corresponding Gaussian approximations. The 
four-term series expansion formula produces an improved 
approximation for the residual signal from sensor number 5 
only. The residual signal for this accelerometer is the most 
nearly Gaussian of the sensors to begin with, as indicated by 
the rms errors for the Gaussian approximations for all six 
sensors. These results suggest that the higher order terms in 
the series expansion should be used with caution: only those 
PDFs that are nearly Gaussian to begin with should be 
approximated with terms higher than the second term in the 
series expansion. 

In the ASP method for near-Gaussian distributions, the 
central moments of the distribution to be approximated are 
used to evaluate the coefficients in the series expansion. A 
second approach, known as the optimized ASP method for 
near-Gaussian distributions, was also evaluated. In the sec- 
ond approach, the higher-order central moments (i.e., jlx 3 , jli 4 , 
jll 5 , and ja 6 ) are treated as free parameters in the model and 
the approximation is optimized by searching for values of 
the central moments that provide the best fit of the distri- 
bution. In this approach, the simplex optimization method is 
used to minimize the rms error of the calculation, as 
described in A Simplex Method for Function Minimization, 
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by J. A. Nelder and R. Mead, Computer Journal, Vol. 7, 
1965, at pages 308 through 313. The simplex method is 
particularly useful for minimizing complicated transcenden- 
tal functions of two or more variables because it minimizes 
5 a function by utilizing evaluations of the function only — no 
evaluations of function derivatives are required. 

In the optimized ASP method for near-Gaussian distribu- 
tions, the first step is to select the approximation formula to 
be optimized, either the one, two, three, or four-term series 
to expansion formula. The number of free parameters in the 
model is determined by the approximation formula used. For 
instance, the one-term formula requires only one free param- 
eter (i.e., |x 3 ), whereas the four-term formula requires four 
free parameters (i.e., |i 3 , p 4 , p 5 , and p 6 ). The function that is 
15 minimized by the simplex algorithm is the rms error for the 
approximation, given by equation E68. The higher-order 
central moments of the distribution are used as initial values 
of the free parameters in the model. The simplex algorithm 
iterates on the free parameters until a minimum in the 
20 function is found, thereby producing the best fit (i.e., small- 
est rms error) between the calculated PDF and the residual 
signal PDF, for a given approximation formula. 

The rms errors of the calculations using the optimized 
ASP method for near-Gaussian distributions are listed in 
25 FIG. 18. Also included in FIG. 18 are the rms errors for 
Gaussian PDFs of the same mean, standard deviation, and 
area as the residual signal PDFs. Comparing the rms errors 
of the Gaussian approximations to those from the optimized 
calculations shows that all optimized series expansion for- 
30 mulas produce significant reductions in the rms error. Com- 
paring the data in FIG. 18 to those of FIG. 17 shows that the 
optimized ASP method produces more accurate approxima- 
tions than does the ASP method for each series expansion 
formula. For a given sensor, the optimized two-term series 
35 expansion formula produces rms errors that are roughly a 
factor of two smaller than those of the optimized one-term 
formula. The optimized three-term and four-term series 
expansion formulas though, produce rms errors that are 
roughly equivalent to those of the optimized two -term 
40 formula. Since the three-term and four-term models incor- 
porate more complicated functions with more free param- 
eters than the two-term model, these results indicate that the 
optimized two-teim series expansion method provides the 
best balance between accuracy and efficiency. 

45 The PDF generated by the optimized two -term series 
expansion formula for the first sensor is shown in FIG. 19. 
The optimized two-teim formula provides an excellent fit of 
the residual signal PDF across all regions of the distribution, 
including the transition regions. Comparing the results of the 
50 optimized ASP calculation in FIG. 19 to those from corre- 
sponding ASP calculations in FIG. 15 and FIG. 16, reveals 
that the main effect of the optimization is to improve the 
accuracy of the approximation through the transition regions 
of the distribution. 

55 The optimized ASP method for near-Gaussian distribu- 
tions produces an excellent fit of the residual signal distri- 
butions from the MSET model of SSME accelerometer 
signals. Because the optimized ASP method is tuned to the 
specific data distributions analyzed, the empirical error 
60 probabilities for the method will better adhere to the theo- 
retical limits for the fault detection procedure. 

A parametric study of the empirical false alarm rate was 
performed to compare the false alarm of the SPRT method 
to the new ASP method for the SSME residual signals. In the 
65 study, the residual signals generated by applying the six 
sensor MSET model to the accelerometer signals from flight 
STS057 engine 1 were analyzed with the SPRT and ASP 
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sequential hypothesis tests at various values of the false 
alarm probability and the system disturbance magnitude 
parameters. Because of the high frequency of the acceler- 
ometer signals (10.24 kHz) and long duration of the flight 
(542.4 sec), the residual signals contain over five million 5 
data values. The residual signal data set was so large that 
every tenth point was extracted from the signals and the fault 
detection models were applied to the subset. The subset of 
the residual signals was large enough to produce accurate 
statistics but small enough to permit multiple calculations in to 
a reasonable time span. Typically, the sequential hypothesis 
tests make a decision for every 1 to 10 data points analyzed. 
Thus for the parametric calculations of the SSME residual 
signals, hundreds of thousands of decisions were made for 
each signal. By dividing the number of fault decisions from 15 
a SPRT analysis of a residual signal into the total number of 
fault and normal decisions made by the test, the empirical 
false alarm rate could be accurately evaluated. 

Referring to FIG. 20, the results from the application of 
the positive mean SPRT and ASP tests to the residual signal 20 
from the first sensor are shown. The calculations shown in 
FIG. 20 were performed with a constant preassigned missed 
alarm probability (p) of 0.01, a constant system disturbance 
magnitude (M) of 6, and a variable preassigned false alarm 
probability (a). The solid diagonal line in the figure repre- 25 
sents the theoretical upper bound for the empirical false 
alarm probability, as defined by equation E39. For the SPRT 
calculation, the empirical false alarm rate satisfies the theo- 
retical upper bound only for preassigned false alarm prob- 
abilities greater than 0.003. For smaller values of the pre- 30 
assigned false alarm probability, the SPRT model reaches 
more fault decisions than one would expect based on theo- 
retical arguments. The theoretical upper bound for the 
empirical false alarm probability is not met for all values of 
the preassigned false alarm probability because the residual 35 
signals are not purely Gaussian, as confirmed in FIG. 13. 
The residual signal distribution has more data populating the 
tail regions than does a Gaussian distribution of the same 
mean, standard deviation, and area as the residual signal 
distribution. The data from the tail regions of a distribution 40 
are a major source of false alarms in the SPRT calculations, 
because they are more representative of the alternative 
hypothesis than they are of the normal hypothesis. Because 
the residual signals exhibit more heavily populated tail 
regions than true Gaussian distributions, they have a ten- 45 
dency to trigger more false alarms than anticipated, espe- 
cially at high levels of sensitivity (i.e., small values of the 
false alarm probability). Because the optimized ASP method 
is tuned to the specific data distributions exhibited by the 
residual signals, the empirical false alarm rate for the ASP 50 
calculation satisfies the theoretical upper bound for all 
values of the preassigned false alarm probability. The ASP 
method produces fewer false alarms than the SPRT method 
for all values of the preassigned false alarm probability. 

ASP Benefits and Applications: 55 

The Adaptive Sequential Probability (ASP) technique was 
developed as the fault detection element of a software 
program that reliably detects signal data faults for an asset, 
such as a process and/or apparatus. These signal validation 
modules improve safety, reduce maintenance cost, and 60 
enable optimal performance for a wide range of aeronauti- 
cal, industrial, chemical, power generating, medical, bio- 
logical, financial, and military assets. Signal validation is 
required in all types of process critical control and safety 
systems where unexpected process interruptions due to 65 
sensor or control component failures or false alarms are 
unsafe or uneconomical. Signal validation assures the safe, 
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reliable operation of a process or apparatus and reduces the 
manpower, schedule and uncertainty associated with sensor 
and component failure detection. Signal validation prevents 
a facility safety or control system from making critical 
decisions, such as the decision to shut down a process or 
abort a mission, on the basis of bad sensor data. Signal 
validation improves process quality and efficiency by ensur- 
ing that closed loop control or data analysis is performed 
using good data. Finally, signal validation increases system 
up time and decreases system maintenance cost by enabling 
sensor calibration using on-condition criteria rather than 
time-in- service criteria. 

Moreover, having thus described the invention, it should 
be apparent that numerous structural modifications and 
adaptations may be resorted to without departing from the 
scope and fair meaning of the instant invention as set forth 
hereinabove and as described herein below by the claims. 

We claim: 

1 . A method for performing surveillance of an asset, the 
steps including: 

acquiring residual data correlative to expected asset 
operation; 

fitting a mathematical model to the acquired residual data 
and storing the mathematical model in a memory 
means; 

collecting a current set of observed signal data values 
from the asset; 

using the mathematical model in a sequential hypotheses 
test to determine if the current set of observed signal 
data indicates unexpected operation of the asset that is 
indicative of a fault condition; 

outputting a signal correlative to a detected fault condition 
for providing asset surveillance. 

2. The method of claim 1 wherein the step of acquiring 
residual data includes a step of acquiring residual data 
correlative to at least one mode of asset operation. 

3. The method of claim 2 wherein the step of acquiring 
residual data correlative to at least one mode of asset 
operation includes a step of acquiring past observations of 
signal data values from the asset and obtaining deviations 
between the past observations of signal data values and 
expected signal values obtained by using a model of 
expected asset operation. 

4. The method of claim 1 wherein the step of fitting the 
mathematical model to the acquired residual data and storing 
the mathematical model in the memory means includes a 
step of fitting at least one mathematical function correlative 
to a frequency distribution to the acquired residual data. 

5. The method of claim 4 wherein the step of fitting at 
least one mathematical function correlative to a frequency 
distribution to the acquired residual data includes a step of 
fitting at least one function defining a frequency curve to the 
acquired residual data. 

6. The method of claim 4 wherein the step of fitting at 
least one mathematical function correlative to a frequency 
distribution to the acquired residual data includes a step of 
fitting at least one function defining a frequency histogram 
to the acquired residual data. 

7. The method of claim 4 wherein the step of fitting at 
least one mathematical function correlative to a frequency 
distribution to the acquired residual data includes a step of 
fitting at least one function defining a frequency polygon to 
the acquired residual data. 

8. The method of claim 1 wherein the step of using the 
mathematical model in the sequential hypotheses test to 
determine if the current set of observed signal data indicates 
unexpected operation of the asset that is indicative of a fault 
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condition further includes the step of performing at least one 
of a group of sequential hypothesis tests comprised of a 
positive mean test, a negative mean test, a nominal variance 
test, and a inverse variance test all as a function of the 
mathematical model. 

9. The method of claim 8 wherein the step of performing 
at least one of the group of sequential hypothesis tests 
comprised of the positive mean test, the negative mean test, 
the nominal variance test, and the inverse variance test all as 
a function of the mathematical model includes a step of 
processing the at least one of the group of sequential 
hypothesis tests and comparing an outcome of each pro- 
cessed test to upper and lower threshold limits for deter- 
mining unexpected operation of the asset that is indicative of 
a fault condition. 

10. The method of claim 1 further including a step of 
determining if an outputted signal correlative to a detected 
fault condition necessitates a control action. 

11. The method of claim 1 further including a step of 
determining if an outputted signal correlative to a detected 
fault condition necessitates an alarm action. 

12. The method of claim 2 where the step of acquiring 
residual data correlative to at least the one mode of asset 
operation includes a step of transforming the residual data 
values to remove noise from the residual data values. 

13. The method of claim 12 wherein the step of trans- 
forming the residual data values to remove noise from the 
residual data values includes a step of transforming the 
residual data values to remove correlated noise from the 
residual data values. 

14. The method of claim 12 wherein the step of trans- 
forming the residual data values to remove noise from the 
residual data values includes a step of transforming the 
residual data values to remove uncorrelated noise from the 
residual data values. 

15. The method of claim 1 wherein the asset is a rocket 
engine. 

16. The method of claim 1 wherein the asset is a rocket 
engine turbo pump. 

17. The method of claim 1 wherein the step of acquiring 
residual data correlative to expected asset operation includes 
a step of acquiring observed signal data values from an 
accelerometer. 

18. The method of claim 1 wherein the step of using the 
mathematical model in the sequential hypotheses test further 
includes a step of determining if a determined unexpected 
operation of the asset that is indicative of a fault condition 
is a degraded signal. 

19. The method of claim 1 wherein the step of using the 
mathematical model in the sequential hypotheses test further 
includes a step of determining if a determined unexpected 
operation of the asset that is indicative of a fault condition 
is a degraded sensor. 

20. The method of claim 1 wherein the step of using the 
mathematical model in the sequential hypotheses test further 
includes a step of determining if a determined unexpected 
operation of the asset that is indicative of a fault condition 
is a degraded component. 

21. The method of claim 1 where the step of using the 
mathematical model in the sequential hypotheses test further 
includes a step of determining if a determined unexpected 
operation of the asset that is indicative of a fault condition 
is an operating anomaly. 

22. The method of claim 1 where the step of fitting the 
mathematical model to the acquired residual data and storing 
the mathematical model in the memory means includes a 
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step of selecting the parameters in the mathematical model 
that provide a best fit to a frequency distribution of the 
acquired residual data. 

23 . The method of claim 22 where the step of selecting the 
5 parameters in the mathematical model that provide the best 
fit to the frequency distribution of the acquired residual data 
includes a step of performing a optimization procedure to 
minimize the error between the mathematical model and the 
frequency distribution of the acquired residual data, 
to 24. The method of claim 23 wherein the optimization 
procedure used to minimize the error between the math- 
ematical model and the frequency distribution of the 
acquired residual data is a simplex optimization procedure. 

25. A method for performing surveillance and control of 
15 an asset, the steps including: 

acquiring residual data correlative to expected asset 
operation; 

fitting a mathematical model to the acquired residual data 
and storing the mathematical model in a memory 
20 means; 

collecting a current set of observed signal data values 
from the asset; 

using the mathematical model in a sequential hypotheses 
test to determine if the current set of observed signal 
25 data indicates an unexpected operation of the asset that 
is indicative of a fault condition; and 

determining if a determined unexpected operation of the 
asset necessitates a control action. 

26. The method of claim 25 further including a step of 
30 performing a control action when a control action is deter- 
mined to be necessitated by the determining step. 

27. The method of claim 25 further including a step of 
determining if a determined unexpected operation of the 
asset necessitates an alarm action. 

35 28. The method of claim 27 further including a step of 

sounding an alarm when an alarm action is determined to be 
necessitated by the determining step. 

29. A system for performing surveillance of an asset, said 
system comprising in combination: 

40 means for acquiring residual data correlative to expected 
asset operation; 

means for fitting a mathematical model to said acquired 
residual data and storing said mathematical model in a 
memory means; 

45 means for collecting a current set of observed signal data 
values from the asset; 

means for using said mathematical model in a sequential 
hypotheses test to determine if said current set of 
observed signal data indicates unexpected operation of 
50 the asset that is indicative of a fault condition; 

a communication means for communicating detected fault 
conditions to a remote location for providing asset 
surveillance. 

30. The combination of claim 29 further including means 
55 for determining if an outputted signal correlative to a 

detected fault condition necessitates a control action. 

31. The combination of claim 29 further including means 
for determining if an outputted signal correlative to a 
detected fault condition necessitates an alarm action. 

60 32. The combination of claim 29 wherein said means for 

fitting said mathematical model to said acquired residual 
data and storing said mathematical model in said memory 
means includes means for selecting parameters in said 
mathematical model that provide a best fit to a frequency 
65 distribution of said acquired residual data. 

33. The combination of claim 29 wherein said means for 
using said mathematical model in said sequential hypotheses 
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test to determine if said current set of observed signal data 
indicates unexpected operation of the asset that is indicative 
of a fault condition further includes means for performing at 
least one of a group of sequential hypothesis tests comprised 
of a positive mean test, a negative mean test, a nominal 5 
variance test, and a inverse variance test all as a function of 
said mathematical model. 

34. The combination of claim 33 wherein said means for 
performing at least one of said group of sequential hypoth- 


40 

esis tests comprised of said positive mean test, said negative 
mean test, said nominal variance test, and said inverse 
variance test all as a function of said mathematical model 
includes means for processing at least one of said group of 
sequential hypothesis tests and comparing an outcome of 
each processed test to upper and lower threshold limits for 
determining unexpected operation of the asset that is indica- 
tive of a fault condition. 



